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IN VITRO PEPTIDE EXPRESSION LIBRARY 

Field of the Invention 

The present invention relates generally to recombinant DNA technology and, 
5 more particularly, to in vitro methods for constructing and screening DNA libraries 
for DNA sequences that encode biologically active molecules. 

Background of the Invention 

Isolating an unknown gene which encodes a desired peptide from a 

10 recombinant DNA library can be a difficult task. The use of hybridisation probes may 
facilitate the process, but their use is generally dependent on knowing at least a 
portion of the sequence of the gene which encodes the protein. When the sequence is 
not known, DNA libraries can be expressed in an expression vector, and antibodies 
have been used to screen plaques or colonies for the desired protein antigen. This 

15 procedure has been useful in screening small libraries, but rarely occurring sequences 
which are represented in less than about 1 in 10 5 clones, as is the case with rarely 
occurring cDNA molecules or synthetic peptides, can be easily missed, making 
screening libraries larger than 10 6 clones at best laborious and difficult. Screening 
larger libraries has required the development of methods designed to address the 

20 isolation of rarely occurring sequences, which are based on the co-selection of 
molecules, along with the DNAs that encode them. In vivo methods have been 
developed to screen large libraries, such as phage display and "peptides on plasmids" 
using lad fusions of peptides. 

Phage display is based on DNA libraries fused to the N-terminal end of 

25 filamentous bacteriophage coat proteins and their expression in a bacterial host 

resulting in the display of foreign peptides on the surface of the phage particle with 
the DNA encoding the fusion protein packaged in the phage particle (Smith G. P., 
1985, Science 228: 1315-1317). Libraries of fusion proteins incorporated into phage, 
can then be selected for binding members against targets of interest (ligands). Bound 

30 phage can then be allowed to reinfect Escherichia coli (E. coli) bacteria and then 

amplified and the selection repeated, resulting in the enrichment of binding members 
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(Parmley, S. F., & Smith, G. P. 1988., Gene 73: 305-318; Barrett R. W. et al, 1992, 
Analytical Biochemistry 204: 357-364 Williamson et al, Proc. Natl. Acad. Sci. USA, 
90: 4141-4145; Marks et al, 1991, J. Mol. Biol. 222: 581-597). 

LacI fusion plasmid display is based on the DNA binding ability of the lac 
5 repressor. Libraries of random peptides are fused to the C-terminal end of the lad 
repressor protein. Linkage of the Lacl-peptide fusion to its encoding DNA occurs via 
the lacO sequences on the plasmid, forming a stable peptide-LacI-peptide complex. 
These complexes are released from their host bacteria by cell lysis, and peptides of 
interest isolated by affinity purification on an immobilised receptor target. The 

10 plasmids thus isolated can then be reintroduced into E. coli by electroporation to 

amplify the selected population for additional rounds of screening (Cull, M. G. et al. 
1992. Proc. Natl. Acad. Sci. U.S.A. 89:1865-1S69). 

These bacterial methods are limited by the size of the library that can be 
created by current methods of introducing DNA into host bacteria, the potential 

1 5 cellular toxicity of the expressed peptides introduced, and by the inability to 

introduce post-translational modifications, or to incorporate novel amino acids into 
the expressed peptide. 

An entirely in vifro ribosome system has been described based on the linkage 
of peptides to the RNA encoding them through the ribosome (WO91/05058). 

20 Ribosome display has also been used for the selection of single-chain Fv antibody 

fragments (scFv) (Matheakis, L. C. et al, 1994 Proc. Natl. Acad. Sci. USA, 91: 9022- 
9026; Hanes, J. & Pluckthun, A. 1997 Proc. Natl Acad. Sci. USA, 94: 4937-4942). 
This method suffers from the lower stability of the RNA genetic material and the 
increased degradation likely under certain selection conditions where RNAse is likely 

25 to be present. 

• The in vitro method described by Griffiths and Tawfik (WO 99/02671 and 
WO 00/40712) addresses some of these concerns by compartmentalizing DNA prior 
to expression of peptides, which then modify the DNA within the compartment. 
Peptides capable of modifications, resulting from enzymatic activity -of interest, are 
30 then selected in a subsequent step. However, no direct selection of peptide binding 
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activity is possible of both peptide and DNA without modification of the DNA 
encoding that peptide, and by releasing the modified DNA from the compartment. 

Another prior art method, covalent display technology, or CDT, is described 
in W098371 86. This method relies on covalent linkage of protein to DNA to retain 
5 the linkage of genotype to phenotype, through the cis action of the crosslinking 

protein. This method teaches that two requirements are needed for successful use of 
this technique. Firstly, proteins are required which interact in vjttro with the DNA 
sequence which encodes them (cis action), and secondly, said proteins must establish 
a covalent linkage to their own DNA template. This method suffers from the fact that 

10 the DNA is chemically modified which can prevent the recovery and identification of 
the binding peptide of interest. 

There remains a need for more versatile in vitro methods of constructing 
peptide libraries in addition to the methods described above, which can allow direct 
selection of binding activity, as well as for enzymatic activity, and that allow efficient 

15 production of complex peptide structures, while still allowing recovery of intact 
genetic material encoding the peptide of interest. 

Summary of the invention 

The present invention therefore provides a method for producing an in vitro 
20 peptide expression library comprising a plurality of peptides, wherein each peptide is 
linked to a DNA construct encoding the peptide, comprising the steps of: 

(a) providing a DNA construct comprising: 

(i) a DNA target sequence; 

(ii) DNA encoding a library member peptide; and 

25 (iii) DNA encoding a peptide capable of non-covalently binding 

directly or indirectly to said DNA target sequence of (ii); 
wherein said DNA construct and encoded protein are selected to have 
cis-activity; 

(b) expressing a plurality of DNA constructs according to (a), wherein 
30 said DNA constructs encode a plurality of library member peptides 
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such that each expressed peptide is non-covalently linked to the DNA 
from which it was produced. 
Also provided is a method for producing an in vitro peptide expression library 
comprising a plurality of peptides, wherein each peptide is linked to the DNA 
5 construct encoding the peptide, comprising the steps of: 

(a) providing a DNA construct comprising: 

(i) DNA encoding a library member peptide; and 

(ii) DNA encoding a peptide capable of non-covalently binding to 
a bifunctional agent; 

10 wherein said DNA construct and encoded protein are selected to have 

cis-activity; 

(b) binding a bifunctional agent or a DNA tag capable of binding a 
bifunctional agent to said DNA construct of (a), wherein said 
bifunctional agent is capable of binding to the peptide encoded by said 

15 DNA of (ii); and 

(c) expressing a plurality of DNA constrcuts according to (b), wherein 
said DNA constructs encode a plurality of library member peptides 
such that each expressed peptide is linked via said bifunctional agent 
to the DNA from which it was produced. 

20 The present invention extends to the peptide libraries produced by such 

methods and to the DNA constructs used in such methods. 

The present invention also provides methods of screening in vitro peptide 
expression libraries of the invention. In one aspect there is provided a method of 
identifying and/or purifying a peptide exhibiting desired properties from an in vitro 

25 peptide expression library produced according to the method of any one of the 

preceding claims, comprising at least the steps of (a) screening said library and (b) 
selecting and isolating the relevant library member. In a second aspect, there is 
provided a method of identifying a specific ligand binding peptide, said method 
comprising at least the steps of (a) screening an in vitro peptide expression library 

30 produced according to the method of the invention with ligand molecules which are 
optionally bound to a solid support; (b) selecting and isolating a library member 
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binding to said target molecule; and (c) isolating the peptide which binds specifically 
to said target molecule. In a third aspect there is provided a method of identifying 
and/or purifying a peptide having the ability to bind a specific DNA target sequence 
comprising at least the steps of (a) providing an in vitro expression library according 
5 to the invention wherein said peptide or protein of (iii) is a library member peptide 
having DNA binding activity and wherein said DNA target sequence of (i) is the 
target sequence of interest; (b) selecting and isolating a library member in which the 
encoded protein binds to said target sequence; (c) isolating the peptide which binds to 
said target sequence. 

10 In addition to isolating and/or identifying specific peptides from the libraries 

of the invention, the screening methods of the invention may be used to isolate and/or 
identify the DNA encoding specific peptides from the library. 

Brief Description of the Figures 

1 5 Figure 1 gives a schematic representation of a method by which a DNA 

construct of the invention may be linked to the peptide that it encodes. 

Figure 2 give a schematic representation of a method of the invention by 
which a DNA binding protein may be converted to a cis-acting DNA binding protein. 
Figure 3 gives a schematic representation of how a target sequence specific 
20 DNA binding protein may be isolated from a library of the invention. 

Figure 4 gives a schematic representation of how a library protein may be 
linked to its coding DNA through cis action and the use of a bi-specific binding 
molecule. 

Figure 5 demonstrates cis activity: 1 : 1 mixture of two different sized input 
25 DNAs (CK-RepA or V5-RepA) selected against either antibody. 1 -Marker DNA; 2- 
PCR amplification after selection on anti-human CK antibody; 3-PCR amplification 
after selection on anti-V5 peptide antibody. 

Figure 6 shows the specificty of anti-V5 antibody binding clones. ELISA 
screening, read at 450nM, of the seven clones (1-7) that show specific binding to 
30 anti-V5 antibody. The bars in group of four represent the ELISA signal of the clones 
screened against from left to right; anti-human kappa region antibody, anti-V5 
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antibody, BSA, and blank. A negative control that neither express CK nor V5 is also 
presented (8). 

Figure 7 shows culture supernatant ELISA OD 450nm signals for peptides 
recovered after 5 rounds of selection against B.globigii spores in Example 4. A. = 

5 clonele; B. = clonelf; C. = clonelg; D. = clone8a; E. = clonelOc; F. = clonelOe; G. 
= negative control. 

Figure 8 shows OD 450nm signals for peptides isolated after 4 rounds of 
selection against anti-V5 antibody in Example 5. A. = P1C12; B. = P2H1; C. = 
P1B5; D. = P2B8. Peptide-phage were tested against anti-V5 and anti-ACTH 

10 peptide antibodies. 

Figure 9 shows OD 450nm signals for synthetic peptides isolated after 4 
rounds of selection against ovalbumin. A. = CI; B. = C4; C. = C6; D. = C8; E. = 
negative control. Peptides were tested against ovalbumin, anti-V5 antibody and 
blocked plate (plastic). 

15 Figure 10. shows PCR recoveries of scFv DNA after selection on BSA or 

BSA-mecoprop. A. Anti-mecoprop scFv selected on BSA, 2.5mM ox-glutathione. B. 
Anti-mecoprop scFv selected on mecoprop-BSA, 2.5mM ox-glutathione. C. Anti- 
mecoprop scFv selected on BSA, no ox-glutathione. D. Anti-mecoprop scFv selected 
on mecoprop-BSA, no ox-glutathione. 

20 

Brief Description of the Sequences 

SEQ ID Nos 1 to 1 1, 19 to 23, 26 to 35 and 45 to 47 show the primers used in 
the Examples. 

SEQ ID NO: 12 shows the sequence of the TAC-MYC-CK-REPA-CIS-ORI 
25 construct, SEQ ID NO: 13 shows the sequence of the TAC-MYC-V5-REPA-CIS- 

ORI construct, SEQ ID NO: 24 shows the sequence of the TAC-V5-REPA-CIS-ORI- 
408 construct and SEQ ID NO: 25 shows the sequence of the TAC-NNB-REPA-CIS- 
ORI-408 construct. 

SEQ ID NO: 14 shows the estrogen receptor target recognition sequence. 
30 SEQ ID Nos 15 and 16 show the DNA and amino acid sequences of the repA 

gene from the Rl plasmid of the incFII incompatibility group. SEQ ID Nos 17 and 
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18 show the sequences of the CIS DNA element and ori sequence form the same 
system. 

SEQ ID Nos 36 to 39 show the sequences of peptides isolated after selection 
in Example 5. SEQ ID Nos 39 to 43 show the sequences of clones isolated in 
5 Example 6. 

Detailed Description of the Invention 

The present invention relates to the construction and screening of a library for 
a nucleotide sequence which encodes a peptide of interest in vitro. The constructs 

10 encoding the peptide of interest are designed such that the expressed peptide shows 
cis activity for the construct. Cis activity is defined as the ability of the peptide to 
bind to the DNA from which the peptide was produced, i.e. from which it was 
transcribed and translated. In vitro expression of the construct results in binding of 
the peptide to the DNA encoding that same peptide molecule by non-covalent 

1 5 interaction. This differs from the teaching of WO 98/371 86, which does not allow for 
the possibility of in vitro non-covalent interaction between protein and the DNA it 
encodes, and indeed specifically excludes such interactions from having any practical 
use for library screening. 

Non-covalent binding refers to an association that may be disrupted by 

20 methods well known to those skilled in the art, such as the addition of an appropriate 
solvent, or a change in ionic conditions, for example, the addition of low pH glycine 
or high pH triethylamine. In the present case, a typical example of hon-covalent 
binding would be the non-covalent interaction between a DNA binding protein and a 
DNA molecule. Conversely, when a covalent linkage is formed between the DNA 

25 and the encoded polypeptide, the displayed peptide or protein will not be released 
from the DNA by ionic conditions and solvents that would disrupt non-covalent 
DNA binding protein:DNA interactions. For example, the bacterial replication 
protein repA binds non-covalently to its target DNA sequence oriR and can be 
released from this target DNA sequence at salt concentrations greater than 0.2M KC1 

30 (Giraldo R. & Diaz R. 1992 J. Mol. Biol. 228: 787-802). This salt concentration 

would not affect a covalent linkage, which would require much harsher conditions to 
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release the covalently bound protein, with the increased risk of damage to the 
recovered DNA. 

The current invention describes cis activity and non-covalent binding which 
allow the encoded peptide or protein to remain associated with the DNA construct 
5 with a half life sufficient to allow individual peptides and the associated DNA 

encoding that peptide with an activity of interest to be separated from the resulting 
mixture of protein DNA complexes. For example, the association between the 
encoded protein and its DNA may have a half life of up to 30 minutes, up to 45 
minutes, up to one hour, up to 2 hours, up to 6 hours or up to 12 hours. The 

10 screening methods of the invention may therefore be carried out immediately after 
construction of the library, or later, for example up to one, up to two, up to six, up to 
twelve hours or up to twenty four hours or more than twenty four hours later. 

Surprisingly, therefore, the invention described herein demonstrates that such 
encoded peptides or proteins can be expressed in vitro and bound to the DNA 

15 encoding that peptide in the presence of other DNA sequences. The invention also 
demonstrates that covalent linkage between protein and DNA is not required to 
maintain such cis activity, and that a non-covalent interaction between DNA and 
binding protein is sufficient to allow selection of peptides in an in vitro expression 
and selection system. 

20 According to the present invention, individual DNA library members, each of 

which encodes a peptide to be expressed in the peptide expression library (library 
member peptide), are placed in a suitable DNA construct. The DNA construct into 
which the DNA library member is placed includes all the sequences necessary to 
allow expression of the library member peptide from the construct and to allow the 

25 peptide encoded by the construct to bind to the DNA construct which encoded it. 
Each peptide in the library will typically comprise a fusion protein comprising the 
library member peptide fused to a peptide involved in binding of the fusion protein to 
the relevant DNA construct. Such fusion proteins may comprise further sequences 
and said library peptide may be joined to said binding peptide via a linker sequence. 

30 . A plurality of such constructs, encoding a plurality of different library 

member peptides form a DNA library of the invention. Expressing such a library of 
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DNA molecules results in the non-covalent binding of individual encoded proteins to 
the DNA which encoded them and from which they have been transcribed and 
translated, in the presence of many other DNA molecules that encode other members 
of the library. The sequence encoding the peptide library member present in a 

5 particular encoded protein will therefore be present in the DNA which is bound to 
that protein. This process therefore links the library member peptide, in a 
biologically active form (usually having a binding activity) to the specific library 
member DNA sequence encoding that peptide, allowing selection of peptides of 
interest, for example due to a particular binding activity, and subsequent isolation 

10 and identification of the DNA encoding that library member peptide. 

For the purposes of the invention a DNA library is therefore a population of 
DNA constructs. Each construct comprises a DNA sequence encoding a peptide to 
be expressed as a library member peptide and each contains appropriate promoter, 
translation start and stop signals. A DNA library of the invention will contain a 

1 5 plurality of such DNA molecules. A plurality of DNA constructs are provided each 
encoding a library member peptide to provide a plurality of different library 
members. Preferably a DNA library will contain at least 10 4 discrete DNA 
molecules. For example, a DNA library may contain more than 10 6 , more than 10 s , 
more than 10 10, more than 10 12 or more than 10 14 discrete DNA molecules. 

20 A peptide expression library is defined as a population of peptide sequences 

expressed from a library of DNA molecules. A peptide expression library of the 
present invention therefore encompasses a library of peptides which are.non- 
covalently bound to the DNA which encoded them. For example, a peptide 
expression library of the present invention may be a library of at least 10 4 discrete 

25 proteins each comprising a library peptide sequence, expressed from a library of 
DNA molecules. A peptide expression library of the invention may be any library 
formed by the expression of a DNA library of the present invention. 

A peptide library member can be defined as an amino acid chain of variable 
composition of at least two amino acids in length, or part or all of a naturally 

30 occurring protein such as an enzyme, a binding molecule such as a receptor or an 
antibody or a fragment thereof. A suitable peptide library member may be a peptide 
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having random amino acid composition. The peptide of variable or random 
composition may be flanked by known amino acid sequences a the N- and/or C- 
terminus to constrain the structure. These known sequences may vary in length. The 
peptide of variable or random composition may be inserted at various positions in a 
5 known protein scaffold, such as a receptor or antibody or other protein or fragment 
thereof. The peptide may be inserted into the same protein scaffold once or more 
than once, for example two or more times. 

A DNA construct according to the present invention may comprise DNA 
encoding a library member peptide and means for the encoded peptide to bind to the 
10 encoding DNA construct. In addition to DNA encoding a library member peptide, a 
suitable DNA construct of the invention comprises at least a DNA target sequence 
and DNA encoding a peptide capable of binding directly or indirectly to the DNA 
target sequence. 

According to the present invention, the DNA construct and the encoded 

15 protein are chosen to have cis-activity. That is, the encoded protein has the ability to 
bind specifically to the DNA molecule which encoded it. For example, cis-activity 
may function to allow the encoded DNA binding peptide to bind specifically (directly 
or indirectly) to the target sequence of the DNA construct which encoded it rather 
than to the target sequence of another DNA construct. 

20 In some cases, cis activity may be provided due to the presence of a DNA 

element that directs cis-activity, i.e. that allows or forces the protein encoded by the 
DNA construct to have cis-activity, and therefore to bind to its encoding sequence. 
In other cases, a separate DNA element per se may not be required where the nature 
of the encoding DNA inherently confers cis activity on the encoded peptide. 

25 A DNA element that directs cis-activity may be provided in the DNA 

construct together with the DNA encoding a peptide that interacts with that cis 
element. For example, in the case of the cis element from the repA system discussed 
below, DNA encoding a fragment of the repA sequence comprising at least 20 amino 
acids from the C terminal of repA may be provided along with the cis DNA element. 

30 It may be possible to confer cis activity upon a DNA binding peptide that is not 

normally cis-acting by including in the DNA construct such a DNA element and any 
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further sequences necessary for its action. For example, DNA encoding a peptide 
that interacts with the cis element used may be included in the DNA construct. 

Alternatively, a peptide that interacts with the cis element may be part of the 
DNA binding peptide. For example, the DNA binding peptide may be repA which 

5 comprises the sequence that interacts with the repA cis element. Alternatively, the 
DNA binding peptide may bind to its encoding DNA in cis without the need for a 
separate cis element. . m 

A suitable DNA element may be any element which allows or directs cis- 
activity. Such a DNA element may act, for example, by interacting with the 

10 machinery involved in translation and transcription of the DNA construct to delay the 
production and release of the encoded peptide. 

Any DNA element which allows the encoded peptide to bind specifically to 
the DNA molecule which encoded it may be used as a DNA element according to the 
present invention. One example of a suitable DNA element is that of the repA-cis 

1 5 system described in more detail below. In that system, RN A polymerase is paused by 
loops in the 5' cis sequence prior to the rho dependent termination of transcription. 
The action of the DNA element therefore allows the encoded binding peptide to bind 
to the DNA target sequence in the construct from which it was produced. 

Preferably, the cis DNA element will be be located 3' in the DNA construct to 

20 the library member peptide and to the peptide or protein capable of binding to the 
DNA target sequence. This means that these sequences may be transcribed and 
translated before the RNA polymerase reaches the cis acting sequence. 

According to the present invention, the binding peptide may be linked to the 
DNA construct directly or indirectly. In the case of direct binding, the binding 

25 peptide binds directly and non-covalently to the DNA target sequence. In the case of 
indirect binding, the link between the binding peptide and DNA construct is provided 
by a further molecule. Such a molecule, for example a bifunctional agent as 
described below, will associate with both the peptide and the DNA target sequence. 
A suitable DNA construct may comprise further sequences, for example 

30 suitable promoter sequences to allow expression of the encoded peptide. 
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One example of a system in which cis-activity exists is the a cis acting 
incompatibility group plasmid replication protein, termed repA, system. Aspects of 
this system may be utilised in the present invention as explained below. 

Numerous plasmids include sequences encoding rep A and cis DNA elements. 
5 The repA sequence and cis DNA element present in a DNA construct of the 
invention may be derived from the same plasmid strain or may be derived from 
different plasmid strains. 

It is believed that the repA-cis system acts as shown in Figure 1 . Briefly, RNA 
polymerase is paused by loops in the 5' -CIS sequence prior to rho dependent 
10 termination of transcription. This allows transient C-terminal repA peptide 

interaction with CIS, and possibly DNA bending. RepA peptide then binds to ori, 
which is a defined distance away from the terminal amino acid of the repA coding 
sequence (Prazkier et al. 2000 J. Bacteriology 182: 3972-3980; Praszkier and Pittard 
1999 J. BacterioL 181: 2765-2772; Masai and Arai. 1988 Nucleic Acids Res. 16: 
15 6493-6514). 

The compatibility of a RepA sequence from a plasmid with a cis sequence 
from another plasmid can be readily determined by monitoring for the interaction of 
RepA with the selected cis sequence. 

Suitable repA proteins and sequences and cis DNA elements include those of 

20 the IncI complex plasmids or the IncF, IncB, IncK, IncZ and IhcL/M plasmids, which 
are distantly related at the DNA level, but which control plasmid replication through 
the action of the cis acting repA protein (Nikoletti et al. 1986 J. BacterioL 170:131 1- 
1318; Prazkier J. et al. 1991 J. BacterioL 173: 2393-2397). Specific plasmids which 
may be used to provide these sequences include the Rl plasmid of the IncII 

25 incompatibility group and the incB plasmid pMU720 (described by Praskier J. & 

Pittard J. 1999 Role of CIS in replication of an IncB plasmid. J. BacterioL 181: 2765- 
2772). The DNA and amino acid sequences of repA derived from the Rl plasmid of 
IncII are given in SEQ ID Nos: 15 and 16. The DNA sequence of the cis DNA 
element from the Rl plasmid of IncII is given in SEQ ID NO: 17. Typically, the cis 

30 element is 150 to 200 nucleotides in length. Shorter or larger sequences may be 

used, so long as the sequence maintains the ability to interact with RepA and display 
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cis activity. Minor variations, such as substitutions or deletions within the cis 
sequence are also contemplated such as modifications at 1, 5, 10 up to 20 nucleotides 
within the wildtype cis sequence. 

The cis element is required for cis activity of the repA protein (Praszkier and 
5 Pittard 1999 J. Bacteriol. 181: 2765-2772). The cis DNA element should therefore 
also be located 3' in the DNA construct to the DNA encoding the repA sequence. On 
reaching the cis sequence, the RNA polymerase will be paused, allowing the encoded 
protein to bind the DNA target sequence. 

In one embodiment of the present invention, the DNA binding protein itself 
10 comprises RepA or a fragment or variant thereof capable of DNA binding, including 
at least the 20 C-terminal amino acids of RepA capable of binding to the cis DNA 
element. In this embodiment, the DNA target sequence comprises an ori sequence, 
for example the oriR sequence. In alternative aspects of the present invention, the 
DNA binding protein is provided by an alternative protein with the relevant DNA 
15 target sequences recognised by such binding protein incorporated into the sequence. 
In each of these embodiments, DNA-protein binding is direct in that the peptide 
encoded by the DNA construct will bind directly to the encoding DNA construct. In 
alternative aspects of the invention, as described in more detail below, the DNA- 
protein binding may be indirect through the use of a peptide tag-DNA tag, 
20 Afunctional agent and/or suitable linkers. 

In one aspect, the same sequence may therefore provide both the peptide 
capable of binding the DNA target sequence and the C terminal amino acids of repA. 
Such a sequence may be or may comprise a complete repA sequence, or a fragment 
or variant thereof of a repA sequence which retains the ability to bind to the DNA 
25 target sequence used. Where the repA acts as a DNA binding protein, both cis and 
ori sequences (Praszkier and Pittard 1999 J. Bacteriol. 181: 2765-2772) are required 
for cis activity (cis) and DNA binding (ori). In this aspect, therefore, the DNA target 
sequence is an ori sequence and the peptide or protein capable of binding said target 
is a rep A protein. The position of ori in the DNA constructs of the invention may be 
30 varied. As described earlier, suitable repA, cis and ori sequences may be provided by 
one or more plasmids. For example, suitable sequences may be provided from the 
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IncI complex plasmids or the IncF, IncB, IncK, IncZ and IncL/M plasmids. The 
DNA sequence of the ori from the Rl plasmid of IncII is given in SEQ ID NO: 18. 
This sequence, or a fragment thereof may be included in a DNA construct of the 
invention. A DNA construct of the invention may include a complete ori sequence or 
5 may include a fragment thereof which is capable of being bound by the rep A protein 
being used. 

The RepA protein used in accordance with the present invention may also 
comprise a fragment or variant of RepA, so long as such variant or fragment of RepA 
maintains the ability to bind to the selected ori sequence. Such variant or fragment of 

10 RepA may include substitutions, for example, at 1, 2, 3 up to 20 amino acids within 
the RepA sequence so long as such variants maintain the ability to bind to the ori 
sequence. A suitable fragment of RepA is an ori binding sequence of RepA. Ori 
sequences include those which are present in wild type plasmids as described above. 
Typically, such an ori sequence is 170 to 220 nucleotides in length. Fragments and 

1 5 variants of wild type ori sequences may also be used, so long as such ori sequences 
maintain the ability to be recognised by RepA. Further cis acting members of the 
RepA protein family can be used. For example, the RepA family of proteins is found 
on plasmids with a broad host range i.e. one RepA plasmid may be found in different 
bacterial species. Isolation of a repA family plasmid from (for example) a 

20 thermophilic, sulfophilic, halophilic or acidophilic bacterium, would provide repA- 
cis-ori DNA that could be used under the current invention at elevated temperatures 
or extremes of salt, pH or sulphur concentrations. Such members of the RepA family 
would be advantageous in isolating library members that can bind to target molecules 
under such extreme conditions. Suitable ori sequences for use in combination with 

25 selected RepA proteins can readily be determined by monitoring for the interaction of 
RepA with such an ori sequence. 

The basic principle of the invention may therefore be described with reference 
to the repA/cis/ori system, as shown in Figure 1 . This shows an example of a DNA 
construct of the invention. This construct comprises, from 5' to 3', a promoter 

30 sequence, a sequence encoding a library member peptide, a sequence encoding a 

repA protein, a cis DNA element and an ori sequence. Briefly, the DNA sequence is 
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transcribed from the promoter by RNA polymerase to RNA. The rho dependent 
termination function present in the cis DNA element causes the RNA polymerase to 
pause at this part of the sequence. This allows the repA protein and the library 
peptide to be translated. The rep A protein is then able to bind to the ori sequence, 

5 linking the encoded protein to the encoding DNA construct. 

In one preferred embodiment, library member DNA sequence(s) are fused to 
the repA, cis and ori DNA of the IncFII plasmid Rl (Masai H et al. 1983 Proc Natl 
Acad Sci USA 80: 6814-6818). In this embodiment, the library member DNA 
sequence(s) of interest may be joined by a region of DNA encoding a flexible amino 

10 acid linker, to the 5 9 -end of the rep A DNA, under the control of an appropriate 
promoter and translation sequences for in vitro transcription/translation. Many 
suitable promoters are known to those skilled in the art, such as the araB, tac 
promoter or the T7, T3 or SP6 promoters, amongst others. The promoter should be 
upstream of the polypeptide sequence to be expressed. 

15 The repA family of proteins is used herein by way of example, not limitation. 

Other unrelated non-covalently binding cis acting DNA binding proteins could be 
used in this invention. 

In a further embodiment, non-cis acting DNA binding proteins may be 
converted to having cis-activity (see Figure 2). This may be achieved by using such 

20 proteins, capable of binding the DNA target sequence, either directly or indirectly, in 
combination with sequences which can confer cis-activity upon them. Cis activity 
may be conferred on a binding protein that does not normally act in cis by including 
in the DNA construct a DNA element that directs cis-activity such as the cis element 
of the rep A system. Such an element may be included to ensure that the DNA 

25 binding by the DNA binding protein is cis, that is, an encoded DNA binding protein 
will bind to the DNA construct from which it has been transcribed and translated. 

In one embodiment, a suitable DNA construct may therefore comprise the 
DNA element that directs cis-activity (the cis DNA element) from the repA system. 
Such an element may further comprise DNA encoding a portion of the C-terminal 

30 end of RepA, preferably at least 20 amino acids, more preferably 30 amino acids, up 
to 40, 50, 60 or 70 amino acids from the C-terminal portion of rep A, wherein said 
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fragment of rep A is capable of interacting with the DNA element within the 
construct. In a further example, proteins such as the cis acting transposases, Tn5 and 
IS903, amongst others, could be used under the current invention (McFall E. J 
Bacteriol. 1986 Aug 167:429-432; Derbyshire KM & Grindley ND. Mol Microbiol. 
5 1996 Sep 1 : 1261-72.). DNA encoding sequences of the present invention may 
comprise wild type sequences encoding the desired fragment of RepA, degenerate 
sequences encoding fragments of wild type RepA or sequences encoding variants of 
such fragments of RepA which maintain the ability to interact with the cis element 
incorporated into the DNA construct. Such variants may include substitution of 1, 2, 
10 3 or 4 amino acids within the 20 amino acid C-terminal of RepA. 

The repA family of proteins is used herein by way of example, not limitation. 
Aiiy DNA element capable of conferring cis-activity on a non-cis acting protein 
could be used . 

Any non-cis acting protein may be converted in this way. By way of example, 

15 not exclusion, the estrogen receptor DNA binding domain (DBD) can be converted 
into a cis acting DNA binding protein. The oestrogen receptor DNA binding domain 
fragment (amino acids 176-282) has been expressed in E. coli and shown to bind to 
the specific double stranded DNA oestrogen receptor target HRE nucleotide 
sequence, with a similar affinity (0.5nM) to the parent molecule (Murdoch et al. 

20 1990, Biochemistry 29: 8377-83S5; Mader et al, 1993, DNAs Research 21 : 1 125- 
1 1 32). In one embodiment, the DNA encoding this sequence is fused, preferably at 
the 3' -end, to the DNA encoding at least the last 20 amino acids of rep A, the cis 
DNA element, and the DNA up to the ori sequence followed by the estrogen receptor 
target recognition sequence (S'-TCAGG TCAGA GTGAC CTGAG CTAAA 

25 ATAAC ACATT CAG-3', SEQ ID NO: 14) which replaces the repA ori recognition 
sequence. The DNA sequence(s) of interest may then be joined by a region of DNA 
encoding a flexible amino acid linker, to the 5'-end of to the estrogen receptor DNA 
fragment,, under the control of an appropriate promoter and translation sequences for 
in vitj'o transcription/translation. Expression of this polypeptide directs the estrogen 

30 receptor DBD to its target sequence, present in place of the normal ori sequence, on 
the DNA encoding that polypeptide. Protein-DNA complexes can then be isolated by 
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capture on a target protein. Unbound protein-DNA complexes can be washed away, 
allowing enrichment for DNA encoding peptides or proteins of interest, which can 
then be recovered by PCR, and enriched further by performing several further cycles 
of in vifro expression and protein-DNA complex capture using methods described 
5 previously. 

It will be clear that this approach will apply to other DNA binding proteins 
simply by using the cis DNA element and a sequence encoding at least the C- 
terminal 20 amino acids of repA, or equivalent elements from a different cis-acting 
system in the DNA constructs. 

10 In another embodiment, libraries of randomized DNA binding proteins, such 

as zinc finger proteins, helix-loop-helix proteins or helix-turn-helix proteins by way 
of example, may be screened for specific binding to a target sequence of interest (see 
Figure 3). In this embodiment, the ori recognition sequence of repA may be replaced 
by a target sequence of interest, and the majority of the repA coding sequence by a 

15 library of randomised zinc finger proteins. The DNA binding proteins therefore act as 
both the library member peptides and the proteins capable of binding the DNA target 
sequence in this aspect. The DNA encoding each zinc finger protein, may 
additionally be joined, at the 5'-end, to a peptide tag sequence which can be 
recognized by an another capture protein such as an antibody, and at the 3 '-end, to 

20 the DNA encoding at least the last 20 amino acids of repA, the cis DNA element, and 
the DNA up to the ori sequence followed by the target sequence of interest. 
Expression of this polypeptide directs the zinc finger protein to the target sequence of 
interest, present in place of the normal ori sequence, on the DNA encoding that 
polypeptide. Binding to the target sequence will only occur if the randomised zinc 

25 finger domain is capable of binding to the sequence of interest. Protein-DNA 

complexes can then be isolated by capture with a binding protein which recognizes 
the peptide tag at the N-terminus of the fusion protein polypeptide. Unbound DNA 
can be washed away, allowing enrichment for DNA encoding zinc finger proteins 
capable of binding the target sequence, which can then be recovered by PCR, and 

30 enriched further by performing several further cycles of in vifro expression and 
protein-DNA complex capture. 
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As explained above, the binding peptide may bind directly to the DNA target 
sequence, for example in the case of a DNA binding protein-target sequence pair, or 
it may bind indirectly to the DNA target sequence, for example via a bifunctional 
agent and optionally a DNA tag (see Figure 4): 

5 In one embodiment, DNA encoding a peptide tag which is not able to bind 

directly to the DNA target sequence is joined to the 5 '-end of library member DNA 
sequence(s) of interest, optionally by a region of DNA encoding a flexible amino acid 
linker, under the control of an appropriate promoter and translation sequences for in 
vitro transcription/translation. This forms the DNA encoding the binding peptide, as 

10 the encoded peptide is linked indirectly to the DNA target sequence. Optionally at the 
3 '-end of the library member DNA sequence is the DNA encoding at least the last 20 
amino acids of repA and the cis DNA element, but not the ori target sequence of 
repA. The DNA target sequence may be or may comprise a DNA tag. Such a DNA 
tag may be a single modified base. For example, when preparing the library DNA 

15 construct containing the elements described, the DNA may be tagged at the 3 '-end 
with, by way of example not limitation, molecules such as fluorescein or biotin. 

Prior to in vitro expression, the library DNA fragments may be mixed with a . 
bifunctional agent, one function of which is to recognize and bind to the target 
sequence which may be at the 5' end of the DNA, in a ratio of one DNA fragment: 

20 one bifunctional molecule. The other functional element of this bifunctional agent is 
a binding agent that can recognize and bind to the peptide tag which may be encoded 
at the 5 '-end of the DNA fragment. By way of example not exclusion, the 
bifunctional agent can be composed of an Fab fragment recognizing the fluorescein 
or biotin tag on the DNA, and another Fab fragment recognizing the peptide tag 

25 encoded in the DNA. It is clear to those skilled in the art that this bifunctional agent 
can be made by many different methods such as chemically cross-linking the two 
elements, or by expressing the two elements as a fusion protein, or as a bi-specific 
antibody. Said methods of creating a bifunctional agent are given by way of example 
not exclusion. 

30 The bifunctional agent may be bound to the DNA construct prior to 

expression of the encoded peptide or may be provided during expression. 
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The fusion protein is then transcribed and translated from the DNA construct 
while bound to the bifunctional agent. The peptide tag is translated first, and can be 
bound by the second element of the bifunctional agent, prior to release of messenger 
RNA or RNA polymerase from the DNA. This creates a functional protein-DNA 
5 complex where both expressed polypeptide and DNA encoding that peptide are 
linked through the bifunctional agent. The peptide tag molecule is therefore linked 
indirectly, but specifically, to the DNA target (tag). By linking the protein to the 
DNA construct in this way, it is possible to screen for a protein having particular 
properties, as described below, and then to identify the encoding DNA which is 

10 linked to that protein. By using a bifunctional agent rather than covalent binding 

between the protein and DNA, the DNA construct may be more easily separated form 
the protein without the risk of damaging the DNA. 

Protein-DNA complexes can then be isolated by capture of a target protein. 
Unbound protein-DNA complexes can be washed away, allowing enrichment for 

1 5 DNA encoding peptides or proteins of interest, which can then be recovered by PCR, 
and enriched further by performing several further cycles of in vitro expression and 
protein-DNA complex capture using methods described previously. 

Additionally, under this embodiment, the DNA can be bound directly, for 
example by covalent binding, to a bifunctional agent such as a polymer. Such a 

20 polymer can contain more than one binding element that could recognise the peptide 
tag, allowing multivalent display of a peptide expression library molecule in a unit 
containing the DNA encoding the displayed peptide. By way of example, not 
limitation, said polymers can be composed of polyethylene as well as other polymeric 
compounds, capable of being fused to DNA. The DNA construct of the invention 

25 may therefore be provided bound to such a bifunctional agent, or bound to a DNA tag 
as decsribed above which is capable of being bound by such a bifunctional agent. 

In all embodiments of the invention, the DNA constructs include appropriate 
promoter and translation sequences for in vitro transcription/translation. Any suitable 
promoter can be used, such as the ara B, tac promoter, T7, T3 or SP6 promoters 

30 amongst others. The promoter is placed so that it is operably linked to the DNA 
sequences of the invention such that such sequences are expressed. 
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The DNA encoding the library member peptides may be produced by any 
sourcible means. In particular, such DNA may comprise DNA isolated from cDNA, 
obtained by DNA shuffling, and synthetic DNA. 

The DNA construct may also encode amino acid linkers within the expressed 
5 fusion protein. In particular, a flexible amino acid linker may be included to join the 
DNA binding peptide/RepA to the library member peptide. 

According to the invention, with reference to this preferred embodiment, 
peptide or protein expression libraries, linked to the DNA encoding them, can be 
generated and peptides with the desired activity selected by the following steps: 

10 . 

Constructing a library of fusion proteins. 

A DNA library of peptides or proteins may be fused to DNA encoding a 
peptide capable of binding to the DNA target sequence, such as a cis acting DNA 
binding protein DNA, by a region of DNA encoding a flexible amino acid linker, 

15 under the control of an appropriate promoter and with a translation, or ribosome 

binding site, start and stop codons, in a manner suitable for in vitro expression of the 
peptide library members and binding proteins. In the example of the repA protein, 
the DNA (such as DNA) library members are fused to the repA DNA binding protein 
DNA, or a fragment thereof The cis and ori sequences may be included in the 

20 construct downstream of the other elements. In the case of a DNA library, said DNA 
constructs are designed to be suitable for in vitro transcription and translation. 

Expression and cis binding of DNA library fusion proteins. 

In order to allow cis activity, a coupled bacterial transcription/translation 

25 environment such as the S30 extract system (Zubay, G. 1973. Ann. Rev. Genet. 7: 
267) may be used. Expression of the peptide, such as the DNA library member 
peptide-repA fusion protein, in this environment, will result in binding of the fusion 
protein to the DNA encoding that fusion protein, provided that both cis and ori 
sequences are present. When libraries of peptide-repA fusion proteins are expressed 

30 in this manner, this process results in the production of libraries of protein-DNA 
complexes where the protein attached to the DNA is encoded by that fragment of 
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DNA from which it was expressed, thereby allowing subsequent selection of both 
peptides or protein of interest, and the DNA encoding said peptides. The complexity 
of these libraries is enhanced by the in vitro nature of the method, libraries of at least 
10 lo -10 14 DNA fragments, if not even larger libraries, can easily be generated. 
5 Compounds that prevent nuclease activity, or reduce non-specific DNA- 

protein or protein-protein interactions may be added during this 
transcription/translation reaction and cis-binding. Examples of suitable compounds 
include detergents and blocking proteins such as bovine serum albumin (BSA). 

1 0 Selection of the peptide of interest. 

An in vitro peptide expression library produced by a method of the present 
invention may be used to screen for particular members of the library. For example, 
the library may be screened for peptides with a particular activity or a particular 
binding affinity. Protein-DNA complexes of interest may be selected from a library 

1 5 by, for example, affinity or activity enrichment techniques. This can be accomplished 
by means of a ligand specific for the protein of interest, such as an antigen if the 
protein of interest is an antibody. The ligand may be presented on a solid surface 
such as the surface of an ELISA plate well, or in solution, for example, with 
biotinylated ligand followed by capture onto a streptavidin coated surface or 

20 magnetic beads, after a library of protein-DNA complexes had been incubated with 
the ligand to allow ligand-ligand interaction. Following either solid phase or in 
solution incubation, unbound complexes are removed by washing, and bound 
complexes isolated by disrupting ligand-ligand interactions by altering pH in the 
well, or by other methods known to those skilled in the art such as protease digestion, 

25 or by releasing the DNA directly from the complexes by heating or phenol- 
chloroform extraction to denature the repA-ori DNA binding. DNA can also be 
released by one of the methods above, directly into PGR buffer, and amplified. 
Alternatively, DNA may be PCR amplified directly .without release from the 
complexes. Optionally, DNA not bound by the binding for example repA protein, 

30 can be protected from degradation by non-specific DNA binding proteins such as 
histones, by way of example. It will be clear to one skilled in the art that many other 
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non-specific DNA binding proteins could be used for this purpose. Further, 
compounds that prevent nuclease activity, or reduce non-specific DNA-protein or 
protein-protein interactions may be present during the selection process. Examples 
of suitable compounds include detergents, blocking proteins such as found in milk 
5 powder or bovine serum albumin (BS A), heparin or aurintricarboxylic acid. 

Recovering bound complexes, reamplifying the bound DNA, and repeating 
the selection procedure provides an enrichment of clones encoding the desired 
sequences^ which may then be isolated for sequencing, further cloning and/or 
expression. For example, the DNA encoding the peptide of interest may be isolated 

10 and amplified by, for example PCR. In one embodiment, repeated rounds of 

selection and DNA recovery may be facilitated by the use of sequential nesting of 
PCR primers. DNA ends are generally damaged after multiple PCR steps. To 
recover DNA from such damaged molecules required the primers to be annealed 
away from the ends of the DNA construct, thereby sequentially shortening the 

1 5 construct with every round of selection. 

In one aspect, the DNA construct and/or the encoded protein may be 
configured to include a tag. Such a peptide or DNA tag, for example as described 
above, may be used in the separation and isolation of a library member of interest. 
Such a tag may also be used to hold the library members, for example on a solid 

20 support for use in the screening methods described herein. 

It can therefore be seen that the screening methods of the present invention 
may include the further step of selecting and isolating the relevant library member 
peptide, allowing the peptide exhibiting the desired, properties, and also the DNA 
encoding that peptide, to be identified and purified. 

25 The invention therefore encompasses peptides and DNAs that have been 

identified by a method of the invention. These peptides and DNAs may be isolated 
and/or purified. The peptides or DNAs isolated by a method of the invention may be 
modified, for example by deletion, addition or substitution of amino acids or 
nucleotides. Suitable modified peptides or DNAs may show at least 50%, at least 

30 75%, at least 90%, at least 95% or more amino acid or nucleotide sequence identity 
to the peptide or DNA isolated by the method of the invention. Peptides identified by 



WO 2004/022746 




CT/GB2003/003860 



23 



10 



15 



20 



25 



a method of the invention may be modified for delivery and/or stability purposes. 
For example, such peptides may be pegylated (attached to polyethylene glycol) to 
prolong serum half life or to prevent protease attack. Peptides identified by a method 
of the invention may be modified in other display systems such as phage display or 
by synthesising and screening peptide variants. A collection of such modified 
sequences may form a new library which may be incorporated into constructs of the 
invention and further screened to find, for example, a variant sequence showing 
improved binding to a particular ligand. Thus in one embodiment, a library of 
peptides for use in the methods of the invention may be a library of structurally 
related peptides. 

Alternatively, a library of essentially random peptide sequences may be used. 
Numerous types of libraries of peptides fused to the cis acting DNA-binding protein 
can be screened under this embodiment including: 

(i) Random peptide sequences encoded by synthetic DNA of variable length. 

(ii) Antibodies or antibody fragments, for example single-chain Fv antibody 
fragments. These consist of the antibody heavy and light chain variable region 
domains joined by a flexible linker peptide to create a single-chain antigen binding 
molecule. 

(iii) Random cDNA fragments of naturally occurring proteins isolated from a 
cell population containing an activity of interest. 

(iv) Random peptide sequences inserted into, or replacing a region of a known 
protein, whereby the known protein sequence acts as a scaffold, which constrains the 
random peptide sequence. Many such scaffolds have been described, by way of 
example, not exclusion, CTLA-4 (WO 00/60070), has been used as a scaffold for 
peptide libraries. 

In another embodiment the invention concerns methods for screening a DNA 
library whose members require more than one chain for activity, as required by, for 
example, antibody Fab fragments for ligand binding. In this embodiment heavy or 
light chain antibody DNA is joined to a nucleotide sequence encoding a DNA 
binding domain of, for example, repA. Typically the unknown antibody DNA library 
sequences for either the heavy (VH and CHI) or light chain (VL and CL) genes are 
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inserted in the 5 f region of the repA DNA, behind an appropriate promoter and 
translation sequences. Thus, rep A fused to aDNA library member-encoded protein is 
produced bound to the DNA encoding that protein. The second known chain, 
encoding either light or heavy chain protein, is expressed separately either: 
5 (a) from the same DNA fragment containing the repA and the first 

polypeptide fusion protein library, or 

(b) from a separate fragment of DNA present in the in vitro 
transcription/translation reaction. 

The known chain associates with the library of unknown fusion proteins that 
10 are fused to the rep A protein and thereby bound to the DNA for the unknown chain. 
The functional Fab library can then be selected by means of a ligand specific for the 
antibody. 

The DNA identified by a screening method of the invention, e.g. the DNA 
encoding the selected library member peptide, may be cloned into a vector. In one 

15 embodiment, the DNA identified by a method of the invention is operably linked to a 
control sequence which is capable of providing for the expression of the coding 
sequence by the host cell, i.e. the vector is an expression vector. The term "operably 
linked" refers to a juxtaposition wherein the components described are in a 
relationship permitting them to function in their intended manner. A regulatory 

20 sequence, such as a promoter, "operably linked" to a coding sequence is positioned in 
such a way that expression of the coding sequence is achieved under conditions 
compatible with the regulatory sequence. 

Such expression vectors are routinely constructed in the art of molecular 
biology and may for example involve the use of plasmid DNA and appropriate 

25 initiators, promoters, enhancers and other elements, such as for example 

polyadenylation signals which may be necessary, and which are positioned in the 
correct orientation, in order to allow for protein expression. Other suitable vectors 
would be apparent to persons skilled in the art. By way of further example in this 
regard we refer to Sambrook et al 1989. 

30 The vectors may be for example, plasmid, virus or phage vectors provided 

with a origin of replication, optionally a promoter for the expression of the said DNA 
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and optionally a regulator of the promoter. The vectors may contain one or more 
selectable marker genes, for example an ampicillin insistence gene in the case of a 
bacterial plasmid or a resistance gene for a fungal vector. Vectors may be used in 
vitro, for example for the production of DNA or RNA or used to transfect or 

5 transform a host cell, for example, a mammalian host cell. The vectors may also be 
adapted to be used in vivo, for example in a method of gene therapy. 

Promoters and other expression regulation signals may be selected to be 
compatible with the host cell for which expression is designed. For example, yeast 
promoters include S. cerevisiae GAL4 and ADH promoters, S. pombe nmt\ and adh 

10 promoter. Mammalian promoters include the metallothionein promoter which can be 
induced in response to heavy metals such as cadmium. Viral promoters such as the 
SV40 large T antigen promoter or adenovirus promoters may also be used. All these 
promoters are readily available in the art. 

Mammalian promoters, such as P-actin promoters, may be used. Tissue- 

1 5 specific promoters are especially preferred. Viral promoters may also be used, for 
example the Moloney murine leukaemia virus long terminal repeat (MMLV LTR), 
the rous sarcoma virus (RS V) LTR promoter, the S V40 promoter, the human 
cytomegalovirus (CMV) IE promoter, adenovirus, HSV promoters (such as the HSV 
IE promoters), or HPV promoters, particularly the HPV upstream regulatory region 

20 (URR). Viral promoters are readily available in the art. 

The vector may further include sequences flanking the polynucleotide of 
interest giving rise to polynucleotides which comprise sequences homologous to 
eukaryotic genomic sequences, preferably mammalian genomic sequences, or viral 
genomic sequences. This will allow the introduction of the polynucleotides of the 

25 invention into the genome of eukaryotic cells or viruses by homologous 

recombination. In particular, a plasmid vector comprising the expression cassette 
flanked by viral sequences can be used to prepare a viral vector suitable for 
delivering the polynucleotides of the invention to a mammalian cell. Other examples 
of suitable viral vectors include herpes simplex viral vectors and retroviruses, 

30 including lentiviruses, adenoviruses, adeno-associated viruses and HPV viruses. 
Gene transfer techniques using these viruses are known to those skilled in the art. 
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Retrovirus vectors for example may be used to stably integrate the polynucleotide 
giving rise to the polynucleotide into the host genome. Replication-defective 
adenovirus vectors by contrast remain episomal and therefore allow transient 
expression. . 

5 Such expression vectors may be used to identify ligands of interest, i.e. 

molecules that bind to the peptide library member by standard binding assays such as 
ELISA, or enzymatic assays where appropriate substrates give, for example a colour 
change, light emission or fluorescence. Other functional assays could be used, where 
available. 

10 In an alternative embodiment, a DNA idientified ny a method of the invention 

may be cloned into a non-expression vector. Such a vector may be used to further 
characterise the DNA, for example by sequencing. 

Alternatively, ligands of interest may be identified without cloning. 
Examples of suitable methods include the in vitro expression of individual DNA 

15 sequences recovered from a screening method of the invention, and sequencing of 
individual DNAs recovered from such a screening method. Such individual DNA 
sequences may optionally be amplified. 

The invention also includes cells that have been modified to express a peptide 
identified by a method of the invention, for example by introducing an expression 

20 vector as described above into the cell. Such cells include transient, or preferably 
stable higher eukaryotic cell lines, such as mammalian cells or insect cells, using for 
example a baculovirus expression system, lower eukaryotic cells, such as yeast or 
prokaryotic cells such as bacterial cells. Particular examples of cells which may be 
modified by insertion of vectors encoding for a peptide identified by a method of the 

25 invention include mammalian HEK293T, CHO, HeLa and COS cells. Preferably the 
cell line selected will be one which is not only stable, but also allows for mature 
glycosylation and cell surface expression of the peptide. Expression may be achieved 
in transformed oocytes. A peptide identified by a method of the invention may be 
expressed in cells of a transgenic non-human animal, preferably a mouse. A peptide 

30 identified by a method of the invention may also be expressed in Xenopus laevis 
oocytes or melanophores. 



WO 2004/022746 



CT7GB2003/003860 



27 



10 



15 



20 



25 



111 order that the hwention is more fully understood, embodiments will now be 
described in more detail by way of example only and not by way of limitation with 
reference to the figures below. 

Examples of some of the embodiments of the invention are given below: 

Materials and Methods 

The following procedures used by the present applicant are described in 
Sambrook, J., et al., 19S9 supra.: analysis of restriction enzyme digestion products on 
agarose gels, DNA purification using phenol/chloroform stock solutions, preparation 
of phosphate buffered saline. 

General purpose reagents were purchased from SIGMA-Aldrich Ltd (Poole, 
Dorset, U.K.). Oligonucleotides were obtained from SIGMA-Genosys Ltd 
(Cambridgeshire, U.K.). Amino acids, and S30 extracts were obtained from Promega 
Ltd (Southampton, Hampshire, U.K.). Deep Vent and Taq DNA polymerases were 
obtained from New England Biolabs (Cambridgeshire, U.K.). Taqplus DNA 
polymerase was obtained from Stratagene Inc. (Amsterdam, Netherlands). 
GeneClean DNA gel purification kits were obtained from BIO101 (La Jolla, 
California, U.S.A.), anti-human IgK antibodies from Immunologicals Direct Ltd 
(Oxfordshire, U.K.), anti-c-rnyc polyclonal from Vector Labs Inc (Cambridgeshire 
U.K.), and anti-V5 antibody from Abeam Ltd (Cambridgeshire U.K.). Superblock 
blocking agent was obtained from Perbio Science (Cheshire, U.K.). 

Example 1, Isolation of specific cis acting protein-DNA complexes 

The in vitro expression constructs were prepared by sequentially adding the 
TAC promoter, the c-myc epitope, either the human kappa constant region or the V5 
epitope to the RepA-CIS-ORI region, by PCR amplification. Such constructs can be 
prepared by many methods known to one skilled in the art, for example, by 
amplifiying different fragments of DNA followed by assembly PCR. In this example, 
the initial amplification template was the Rl plasmid which contains the RepA-CIS- 
ORI region (Masai, H. and Arai, K.(1988). DNAs Res. 16, 6493-6514). 
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(a) Primary amplification. The RepA-CIS-ORI region was PCR amplified 
from a single colony of the strain ECO K12 harbouring plasmid Rl using 12.5pmol 
of each of the primers REPAFOR (SEQ ID 01) and ORIREV (SEQ ID 02) in a 50pl 



lx PCR reaction buffer (Stratagene Inc, Amsterdam, Netherlands). The REPAFOR 
primer anneals to the 5' -end of the Rep A coding region. The ORIREV primer 
anneals to the 3' -end of the non-coding ORI region. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 45 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40|al sterile water using a Geneclean II kit according to the manufacturers 
instructions (Biol 01, La Jolla, California, U.S.A.). 

(b) Secondary amplification. One jal (500 pg) of 100 times diluted gel-purified 
primary reaction product was re-amplified using 12.5pmol of each of the primers 
CKREPFOR (SEQ ID 03) and ORIREV (SEQ ID 02) in a 50^1 reaction containing 
0.25mM l dNTPs, 2.5 units Taqplus Precision DNA polymerase, and lx PCR reaction 
buffer (Stratagene Inc, Amsterdam, Netherlands). The CKREPFOR primer anneals to 
the 5 '-end of the primary reaction product and appends the 3' part of the kappa 
constant region DNA. The ORIREV primer anneals to the 3 '-end of the primary 
reaction product. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 2 minutes, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40j.il sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(c) Third amplification. One (500 pg) of 100 times diluted gel-purified 
primary reaction product was re : amplified using 12.5pmol of each of the primers 
V5REPFOR (SEQ ID 04) and ORIREV (SEQ ID 02) in a 50jxl reaction containing 
0.25mM dNTPs, 2.5 units Taqplus Precision DNA polymerase, and lx PCR reaction 



reaction containing 0.25mM dNTPs, 2.5 units Taqplus Precision DNA polymerase, 
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• buffer (Stratagene Inc, Amsterdam, Netherlands). The V5REPFOR primer anneals to 
the -5*-end of the primary reaction product and appends the 3' part of the V5 epitope 
DNA. The ORIREV primer anneals to the 3 '-end of the primary reaction product. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
5 minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 2 minutes, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40j.il sterile water using a Geneclean II kit according to the manufacturers 
instructions (Biol 01, La Jolla, California, U.S.A.). 

1 0 (d) Fourth amplification. One \x\ (500 pg) of 1 00 times diluted pCKV5 

plasmid using 12.5pmol of each of the primers MYCCKFOR (SEQ ID 05) and 
CKREV (SEQ ID 06) in a 50\x\ reaction containing 0.25mM dNTPs, 2.5 units 
Taqplus Precision DNA polymerase, and lx PCR reaction buffer (Stratagene Inc, 
Amsterdam, Netherlands). The pCKV5 plasmid contains the human kappa constant 

15 region cDNA (McGregor DP, Molloy PE, Cunningham C, & Harris WJ. 1994 Mol. 
Immunol. 3 1 : 21 9-26) and the V5 epitope DNA (Southern JA, Young DF, Heaney F, 
Baumgartner WK, Randall RE. 1991 J. Gen. Virol. 72: 1551-7). The MYCCKFOR 
primer anneals to the 5 '-end of the kappa constant region DNA and appends the V 
part of the MYC epitope DNA. The CKREV primer anneals to the 3 '-end of the 

20 kappa constant region DNA. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 2 minutes, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 

25 the gel into 40pl sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(e) Fifth amplification. One jal (500 pg) of 100 times diluted pCKV5 plasmid 
using 12.5pmol of each of the primers MYCV5FOR (SEQ ID 07) and V5REV (SEQ 
ID 08) in a 50|il reaction containing 0.25mM dNTPs, 2.5 units Taqplus Precision 

30 DNA polymerase, and 1 x PCR reaction buffer (Stratagene Inc, Amsterdam, 

Netherlands). The MYCV5FOR primer anneals to the 5'-end of the V5 epitope DNA 
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and appends the 3' part of the MYC epitope DNA. The V5REV primer anneals to the 
S'-end of the V5 epitope DNA. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
5 - seconds; 72°C, 30 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40|xl sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl , La Jolla, California, U.S.A.). 

(f) Sixth amplification. One ^il (500 pg) of 100 times diluted pTACP2A 
10 plasmid (ref) using 12.5pmol of each of the primers TAC3 (SEQ ID 09) and 

MYCTACREV (SEQ ID 10) in a 50fil reaction containing 0.25mM dNTPs, 2.5 units 
Taqplus Precision DNA polymerase, and lx PCR reaction buffer (Stratagene Inc, 
Amsterdam, Netherlands). The TAC3 primer anneals to the 5 '-end of the TAC 
promoter DNA. The MYCTACREV primer anneals to the 3 '-end of the TAC 

15 promoter DNA and appends the 5' part of the MYC epitope DNA. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 30 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 

20 the gel into 40(il sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.); 

(g) First assembly PCR. One |il (50 ng) of each of the reaction products in (f) 
and (d) using 50 pmol of each of the primers TAC5 (SEQ ID 1 1) and CKREV (SEQ 
ID 06) in a 50|il reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent DNA 

25 * polymerase mixture (20:1), and lx PCR reaction buffer (New England Biolabs, 
Beverly, MA, U.S.A.). The TAC5 primer anneals to the 5'-end of the reaction 
product (f) and adds 20 nucleotides. The CKREV primer anneals to the 3 '-end of the 
reaction product (d). 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 

30 minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 45 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
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products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40^1 sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Joila, California, U.S.A.). 

(h) Second assembly PCR. One \xl (50 ng) of each of the reaction products in 
5 (f) and (e) using 50 pmol of each of the primers TAC5 (SEQ ID 1 1) and V5REV 

(SEQ ID 08) in a 50|al reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent 
DNA polymerase mixture (20:1), and lx PCR reaction buffer (New England Biolabs, 
Beverly, MA, U.S.A.). The TAC5 primer anneals to the 5'-end of the reaction 
product (f) and adds 20 nucleotides. The V5REV primer anneals to the 3 '-end of the 

1 0 reaction product (e) . 

PCR reactions were earned out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C ? 45 seconds; 60°C, 45 
seconds; 72°C, 45 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 

1 5 the gel into 40^x1 sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(i) Third assembly PCR. One jil (50 ng) of each of the reaction products in (b) 
and (g) or using 50 pmol of each of the primers TAC3 (SEQ ID 09) and ORIREV 
(SEQ ID 02) in a 50[xl reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent 

20 DNA polymerase mixture (20: 1), and 1 x PCR reaction buffer (New England Biolabs, 
Beverly, MA, U.S.A.). The TAC3 primer anneals 20 nucleotides downstream to the 
5'-end of the reaction product (g). The ORIREV primer anneals to the 3'-end of the 
reaction product (b). The reaction product in (i) is called TAC-MYC-CK-REPA-CIS- 
ORI (SEQ ID 12). 

25 PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 

minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40fxl sterile water using a Geneclean II kit according to the manufacturers 

30 instructions (Bio 1 01 , La Jolla, California, U.S.A.). 
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(j) Fourth assembly PCR. One \xl (50 ng) of each of the reaction products in 
(b) and (h) or using 50 prnol of each of the primers TAC3 (SEQ ID 09) and 
ORIREV (SEQ ID 02) in a 50|il reaction containing 0.25mM dNTPs, 2.5 units 
TaqDeepVent DNA polymerase mixture (20:1), and Ix PCR reaction buffer (New 
England Biolabs, Beverly, MA, U.S.A.). The TAC3 primer anneals 20 nucleotides 
downstream to the 5'-end of the reaction product (g). The ORIREV primer anneals to 
the 3 '-end of the reaction product (b). The reaction product in (i) is called TAC- 
MYC-V5-REPA-CIS-ORI (SEQ ID 13). 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 2 
minutes and 15 seconds at 94°C followed by 30 cycles of 94°C, 45 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40pl sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

Preparation of in vitro transcription/translation reaction. The reaction was set 
up on ice, using a Promega bacterial linear template S30 coupled in vitro 
transcription/translation reaction kit as follows: 

20jal TAC-MYC-CK-REPA-CIS-ORI template (0.5^ig of final construct DNA SEQ 
ID 012 above); 20^1 TAC-MYC-V5-REPA-CIS-ORI template (0.5^ig of final 
construct DNA SEQ ID 013 above); 20|il complete amino acid mix (Promega); 80jal 
S30 Premix; 60jal S30 mix; 

and the reaction was allowed to proceed at 25 °C for 30 minutes and placed on ice, 
then diluted 10 fold with blocking buffer (Superblock (Perbio Ltd), 0.1 % Tween 20, 
200|ig/ml herring sperm DNA). 

DNA-protein complex capture. NUNC star immunotubes were coated with 
lOfig/ml of either anti-c-myc antibody, anti-V5 antibody, or anti-human kappa chain 
antibody, in 500|il PBS per tube overnight at 4°C. An additional tube was left blank 
as a negative control. Tubes were washed 2x PBS and blocked for 1 hour at room 
temperature with Superbloclc/PBS/O.lmg/ml herring sperm DNA/ 0.1% Tween 20 
and then washed 2x PBS. 500pl of diluted transcription/translation reaction was 
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added to each tube and incubated at room temperature for 1 hour. Tubes were washed 
5x PBS/0.1% Tween 20, then Ix 30 minutes with 2ml Superblock/PBS/O.lmg/ml 
herring sperm DNA/ 0.1% Tween 20, then 5x PBS. DNA was recovered with 300p.l 
T.E. buffer plus 300(^1 phenol/chloroform for 5 minutes with shaking. This was 
5 centrifuged at 1 3,200g for 5 minutes and DNA precipitated with 0.5 volume of 7.5M 
ammonium acetate, 20\xg glycogen and three volumes of absolute ethanol. Following 
centrifugation, pellets were washed with 70% ethanol, vacuum dried and resuspended 
in 20jil water. 10(il of recovered DNA was reamplified in 50jal reactions with TAC3 
(SEQ ID 09) and ORIREV (SEQ ID 02) primers. Reaction products were 
10 . electrophoresed on a 1% agarose/TAE gel (Figure 5). 

Example 2. Separating the RepA-DNA complex 

The two in vitro expression constructs (SEQID12 and SEQID13) already 
described in example 1 were used in a selection experiment against anti-human C- 
15 kappa antibody as described in Example 1, except that DNA was recovered and 
released from RepA by using either of following methods; Glycine, Triethylamine, 
Phenol/Chloroform, Proteinase K, and EDTA. These methods are described below. 

Glycine: tube was incubated with 500jil of 200mM Glycine, 150mM NaCl 
(pH2.0) for 10 minutes. The glycine eluate was then transferred to a fresh eppendorf 
20 tube and 50pl of 2M Tris (pH 8.5) added. 

Triethylamine: the tube was incubated 500^1 of 0.1 M Triethylamine for 10 
minutes and the triethylamine eluate was then transferred to a fresh eppendorf tube 
and 250^1 of 1M Tris (pH 7.4) added. 

Phenol/Chloroform: as example 1 above. 
25 Proteinase K: the tube was incubated with 500|al of lOOmM Tris (pH 8.0), 10 

inM EDTA (pH 8.0), 0.5% SDS for 30 minutes at 37°C. The Proteinase K eluate was 
then transferred to a fresh eppendorf tube. 

EDTA: the tube was incubated with 250|ii of 10mM Tris (pH 8.0), 1 mM 
EDTA 500mM NaCl and 250^1 of Phenol/Chloroform for 5 minutes. The EDTA 
30 eluate was then transferred to a fresh eppendorf tube. 
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After recovery of DNA the DNA was Phenol/Chloroform extracted, where 
appropriate, followed by Ethanoi precipitation as described in Example 1. lOul of 
resuspended DNA was reamplified in 50ul reactions with TAC3 (SEQID09) and 
CISREV (SEQID019) primers. The CISREV primer anneals 196 bases upstream of 
5 the binding site of ORIREV (SEQID02). Reaction products were electrophoresed on 
a 1% agarose/TAE gel (data not shown). Only the CK-DNA containing construct 
(SEQID 12) was amplified, in approximately equivalent amounts. 

This not only tells us that any of the methods described above for recovering 
and releasing DNA from RepA can be used, but this result also suggests that RepA 
1 0 interacts in a non-covalent manner with its cognate DNA. 

Example 3, Detection of specific anti-V5 binders in a V5-spiking expeiment 
using CIS display technology. 

The in vitro expression constructs were prepared by adding the TAC 
1 5 promoter and either the V5 epitope or a 12-mer NNB library to the RepA-CIS-ORI 
region, by PCR amplification. Such constructs can be prepared by many methods 
known to one skilled in the art, for example, by amplifiying different fragments of 
DNA followed by assembly PCR. In this example, the initial amplification template 
was the Rl plasmid which contains the RepA-CIS-ORI region (Masai, H. and Arai, 
20 K.(1988). Nucleic Acids Res. 16, 6493-6514). 

(a). Primary amplification. The RepA-CIS-ORI region was PCR amplified 
from a single colony of the strain ECO K12 harbouring plasmid Rl using 12.5pmol 
of each of the primers REPAFOR (SEQ ID 01) and ORIREV408 (SEQ ID 20) in a 
50pl reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent DNA polymerase 
25 mixture (20:1), and lx PCR reaction buffer (New England Biolabs, Beverly, MA, 
U.S.A.). The REPAFOR primer anneals to the 5'-end of the RepA coding region. 
The ORIREV408 primer anneals to the downstream of the 3 '-end of the non-coding 
ORI region. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 
30 minutes and 30 seconds of 94°C followed by 25 cycles of 94°C, 30 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 
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products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40jal sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(b) . Secondary amplification. One \xl (500 pg) of 100 times diluted gel- 
5 purified primary reaction product was re-amplified using 12.5pmol of each of the 

primers V5(NNB)REPFOR (SEQ ID 21) and ORIREV408 (SEQ ID 20) in a 50^1 
reaction containing 0.25mM dNTPs, 2.5 units TaqDeepVent DNA polymerase 
mixture (20: 1), and lx PCR reaction buffer (New England Biolabs, Beverly, MA, 
U.S.A.). The V5(NNB)REPFOR primer anneals to the 5'-end of the primary reaction 

10 product and appends the V5 epitope DNA. The ORIREV408 primer anneals to the 
3 ' -end of the primary reaction product. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 
minutes and 30 seconds of 94°C followed by 25 cycles of 94°C, 30 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72°C. Reaction 

1 5 products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40^1 sterile water using a Geneclean II kit according to the manufacturers 
instructions (Bio 1 0 1 , La Jolla, California, U.S.A.). 

(c) . Third amplification. One \x\ (500 pg) of 1 00 times diluted gel-purified 
primary reaction product was re-amplified using 12.5pmol of each of the primers 

20 NNBREPFOR (SEQ ID 22) and ORIREV408 (SEQ ID 20) in a 50^1 reaction 

containing 0.25mM dNTPs, 2.5 units TaqDeepVent DNA polymerase mixture (20:1), 
and lx PCR reaction buffer (New England Biolabs, Beverly, MA, U.S.A.). The 
NNBREPFOR primer anneals to the 5 '-end of the primary reaction product and 
appends a random amino acid 12-mer NNB library DNA. The ORIREV408 primer 

25 anneals to the 3' -end of the primary reaction product. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 4 
minutes and 30 seconds of 94°C followed by 25 cycles of 94°C, 30 seconds; 60°C, 45 
seconds; 72°C, 1 minute, followed by a single cycle 10 minutes at 72 8 C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 

30 the gel into 40jxl sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 
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(d) . Fourth amplification. One (500 pg) of 100 times diluted pTACP2A 
plasmid (ref) using 12.5pmol of each of the primers TACFARUP (SEQ ID 23) and 
TACREV (SEQ ID 27) in a 50^1 reaction containing 0.25mM dNTPs, 2.5 units 
TaqDeepVent DNA polymerase mixture (20:1), and Ix PCR reaction buffer (New 

5 England Biolabs, Beverly, MA, U.S.A.). The TACFARUP primer anneals to the 5'- 
end of the TAC promoter DNA. The TACREV primer anneals to the 3' -end of the 
TAC promoter DNA. 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 1 
minutes and 45 seconds of 94°C followed by 25 cycles of 94°C, 15 seconds; 60°C, 30 
10 seconds; 72°C, 30 seconds, followed by a single cycle 10 minutes at 72°C. Reaction 
products were electrophoresed on an agarose gel, excised and products purified from 
the gel into 40jil sterile water using a Geneclean II kit according to the manufacturers 
instructions (BiolOl, La Jolla, California, U.S.A.). 

(e) . First assembly PCR. One ^1 (50 ng) of each of the reaction products in 
15 (b) and (d) using 50 pmol of each of the primers TACFARUP (SEQ ID 23) and " 

ORIREV408 (SEQ ID 20) in a 50\xl reaction containing 0.25mM dNTPs, 2.5 units 
TaqDeepVent DNA polymerase mixture (20:1), and lx PCR reaction buffer (New 
England Biolabs, Beverly, MA, U.S.A.). The TACFARUP primer anneals to the 5'- 
end of the reaction product (d). The ORIREV480 primer anneals to the 3' -end of the 

20 reaction product (b). The reaction product in (e) is called TAC-V5-REPA-CIS-ORI- 
408 (SEQ ID 24). . 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 1 
minutes and 45 seconds of 94°C followed by 25 cycles of 94°C, 15 seconds; 60°C, 30 
seconds; 72°C, 1 minute and 30 seconds, followed by a single cycle 10 minutes at 

25 72°C. Reaction products were electrophoresed on an agarose gel, excised and 
products purified from the gel into 40^1 sterile water using a Geneclean II kit 
according to the manufacturers instructions (BiolOl, La Jolla, California, U.S.A.). 

(f) . Second assembly PCR. One \x\ (50 ng) of each of the reaction 
products in (c) and (d) using 50 pmol of each of the primers TACFARUP (SEQ ID 

30 23) and ORIREV408 (SEQ ID 20) in a 50|il reaction containing 0.25mM dNTPs, 2.5 
units TaqDeepVent DNA polymerase mixture (20:1), and lx PCR reaction buffer 
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(New England Biolabs, Beverly, MA, U.S.A.). The TACFARUP primer anneals to 
the 5'-end of the reaction product (d). The ORIREV480 primer anneals to the 3'-end 
of the reaction product (c). 

PCR reactions were carried out on a Eppendorf Master Cycler for 1 cycle of 1 

5 minutes and 45 seconds of 94°C followed by 25 cycles of 94°C, 15 seconds; 60°C, 30 
seconds; 72°C, 1 minute and 30 seconds, followed by a single cycle 10 minutes at 
72°C. Reaction products were electrophoresed on an agarose gel, excised and 
products purified from the gel into 40fii sterile water using a Geneclean II kit 
according to the manufacturers instructions (BiolOl, La Jolla, California, U.S.A.). 

10 The reaction product in (f) is called TAC-NNB-REPA-CIS-ORI-408 (SEQ ID 25). 

Preparation of in vitro transcription/translation reaction: The reaction set was 
set up on ice, using a Promega bacterial linear template S30 coupled in viti'o 
transcription/translation reaction kit as follows: 

20[i\ of 5000 times diluted TAC-V5-REPA-CIS-ORI-40S template (0.1 ng of final 
1 5 construct DNASEQ ID 24 above) 

20|il of 5 TAC-NNB-REPA-CIS-ORI-408 template (0.5jag of final construct 

DNASEQ ID 25 above) 

20^x1 complete amino acid mix (Promega) 

80jal S30 Premix 
20 60^1 S30 mix 

and the reaction was allowed to proceed at 25 °C for 30 minutes and placed on 

ice, then diluted 10 fold with 2% Marvel/PBS. 

DNA-protein complex capture. NUNC star immunotubes were coated with 

lOjxg/ml of anti-V5 antibody in 500pl PBS overnight at 4°C. An additional tube was 
25 left blank as a negative control. Tubes were washed 2x PBS and blocked for 1 hour at 

room temperatue with blocking buffer (2% Marvel, 0.1% Tween 20, O.lmg/ml 

herring sperm DNA) and then washed 2x PBS. 1 ml of diluted 

transcription/translaiton reaction was added to each tube and incubated at room 

temperature for 1 hour. Tubes were washed 5x PBS/0.1% Tween 20 and then 5x 
30 PBS. DNA was recovered with SOOjxl TE buffer plus 500|il phenol/chloroform. This 

was centrifuged at 13,200g for 5 minutes and DNA precipitated with 1/10 volume of 
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3M sodium acetate, 50jig/ml glycogen and two voulmes of absolute ethanol. 
Following centrifugation, pellets were washed with 70% ethanol, vacuum dried and 
resuspended in 40jxi water. 20 \x\ of recovered DNA was reamplified in 50 f-tl 
reactions with the biotinylated'primers bTAC6 (SEQ ID 26) and bCISREV (SEQ ID 

5 19). Reaction products were electrophoresed on a 1% agarose/TAE gel. 

Cloning of recovered DNA into the expression vector pDMG-K (SEQ ID 27). 
Reaction product were gelpurified and eluted with 50|.il sterile water using a 
QIAquick Gelextracation kit according to the manufacturers instructions (QIAGEN 
LtdWest Sussex, U.K.). Both the purified reaction product and the plasmid pDMG-K 

10 were digested with 20 units of Ncol and NotI (New England Biolabs, Beverly, MA, 
U.S.A.). The cut plasmid was gelpurified using a QIAquick Gelextracation kit 
according to the manufacturers instructions (QIAGEN LtdWest Sussex, U.K.), then 
treated with 0.01 units of Calf Intestinal Alkaline Phosphatase (Promega, 
Southampton, U.K.) followed by phenol/chloroform extraction and ethanol 

15 precipitation as described above. Precipitated DNA was dissolved in 20|xl of water. 
The cut PCR product was transferred to Streptavidin coated strips (Roche 
Diagnostics Ltd, East Sussex, U.K.) in lx TBS, 0.3 mg/ml BSA, 0.1% Tween 20 and 
incubated for 30 minutes at room temperature, shaking. This approach removes the 
flanking biotinylated DNA upstream and downstream of the Ncol and NotI site of the 

20 PCR product and enables recovery of the small DNA fragment containing the 

selected peptide sequence. Supernatant was phenol/chloroform extracted and ethanol 
precipitated as described above. Precipitated DNA was dissolved in lOjxl of water. 
Cut plasmid and the isloated small DNA fragment containing the selected peptide 
sequence, both having Ncol and NotI overhangs, were ligated using a Quick ligation 

25 kit according to the manufacturers instructions (New England Biolabs, Beverly, MA, 
U.S.A.) followed by phenol/chloroform extraction and ethanol precipitation as 
described above. Precipitated DNA was dissolved in lOjxl of water and electroporated 
into electrocompetent TGI cells according to the manufacturers instructions 
(Stratagene, U.S.A.) and selected on plates with 2xTY, 100|ig/ml ampicillin, and 2% 

30 glucose. 
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Anti-V5 antibody ELISA screening of selected clones. 8S colonies were 
picked into 400^.1 of 2x TY, 2% glucose, and 100ng/ml ampicillin and grown 
overnight at 37*C, shaking 300 rpm. 50|o,l of the overnight cultures were transferred 
into 1ml of 2x TY, 2% glucose, and 100p.g/ml ampicillin and grown at 37°C, shaking 

5 300 rpm until OD 0.5. Then the cells were centrifuged at lOOOx g for 10 minutes. 
The supernatants were discarded and pellets were resuspended in 600|ui of 2x TY, 
0.4M sucrose, lOOjig/ml ampicillin, and ImM IPTG and grown for 4 hours at 37*C, 
300 rpm. After induction the cells were centrifiiged at lOOOx g for 10 minutes. 150fxl 
of the supernatants were used in the ELISA test. NUNC Maxisorp plates were coated 

10 with 100|il of lug/ml in lx PBS of either anti-human kappa region antibody or anti- 
V5 antibody or 50jxg/ml of BSA for 7 hours at room temperature. An additional plate 
was left blank, only coated with PBS. Wells were rinsed 2x PBS followed by 
blocking for 1 hour at room temperature with 300p.l of 4% Marvel, 0.1% Tween in 
lx PBS. Wells were rinsed 2x PBS, then 150jxl of supernatant and lSOjal of 4% 

15 Marvel, 0.1% Tween 20 in lx PBS were added to wells and incubated for 1 hour at 
room temperature. Wells were then washed 2xPBS, 0.1% Tween 20 and 2x PBS. 
Secondary antibody anti-human kappa region antibody conjugated to horse radish 
peroxidase (HRP)(final concentration 1.6p,g/ml) was diluted 500 times in 4% Marvel, 
0.1% Tween 20, lx PBS and added to wells and incubated for 1 hour at room 

20 temperature. Wells were then washed 4x PBS, 0. 1% Twefen 20 and 2x PBS. The 
HRP signal was detected by adding 200^1 of TMB substrate. Reaction was stopped 
with 100(xl of 0.5M sulphuric acid. Absorbance was read at 450nm. 35 out of 88 
clones expressed well judged by HRP signal from clones screened against anti- 
human kappa region antibody. 7 out of these 35 clones showed specific binding to 

25 anti-V5 antibody, thereby enriching V5-peptides from 1 in 5000 to 1 in 5, i.e. an 
enrichment factor of 1000 (Figure 6). 

Example 4: CIS display Library construction, selection & screening against 
Bacillus globigii 
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Library Construction 

To generate library DNA, a promoter library DNA fragment and the RepA- 
ClS-ori fragment must be generated, then linked together by digestion-ligation. The 
tac promoter from a P2 A-HA vector was used in this example, but many available 

5 promoters could be used, and are well known to those skilled in the art. The initial 
PCR of Rep-CIS-ori and TAC fragments appends Bspl20I site and the random 
library/Notl site respectively. Two master mixes were prepared: 

10|il of 1 :50 diluted P2A-HA plasmid DNA (25ng/reaction) was PCR 
amplified in 20x 50^1 reaction volume containing 200|iM dNTPs, lxNEB 

1 0 polymerase amplification buffer (1 OmM KC1, 1 OmM (NH 4 ) 2 S0 4 , 20mM Tris-HCl 
pH 8.8, 2mM MgS04, 0.1% TritonX-100) with lOpmol of each of the primers 
TACFARUP (SEQ ID 23) and NTERM1 8MER (SEQ ID 28) primers and 2 units of 
20: 1 Taq DNA polymerase: Deep Vent DNA polymerase mixture (NEB) for 25 
cycles of 94°C, 40 seconds; 60°C, 40 seconds; 72°C, 60 seconds; followed by a 5 

1 5 minutes extension at 72°C. 20jil of reaction product were electrophoresed on a 1% 
agarose/TAE gel and photographed, while the remainder was Qiagen column purified 
into 200^1 water. 

10(il of Bspl20I corrected Rep-CIS-ori DNA (50ng/reaction) was PCR 
amplified in lOx 50|il reaction volume containing 200|iM dNTPs, lxNEB 

20 polymerase amplification buffer (lOmM KC1, lOmM (NH 4 )2S0 4 , 20mM Tris-HCl 
pH 8.8, 2mM MgS04, 0.1% TritonX-100) with lOpmol of each of the primers 
BSPREPAFOR (SEQ ID 29) and ORIREV (SEQ ID 02) primers and 2 units of 20:1 
Taq DNA polymerase: Deep Vent DNA polymerase mixture (NEB) for 30 cycles of 
94°C, 40 seconds; 60°C, 40 seconds; 72°C, 90 seconds; followed by a 5 minutes 

25 extension at 72°C. 20|il of reaction product were electrophoresed on a 1% 

agarose/TAE gel and photographed, while the remainder was Qiagen column purified 
into 120jil water. 

Library-TAC product was then digested with lOjil NotI (NEB) (lOOu) for 1 
hour at 37°C in a 300jal reaction volume, then Qiagen column purified into a 120j.il 
30 volume of water. The two products were then joined by restriction-ligation as 
follows: 
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IOxNEB buffer 4 


17JJ.1 


1 OOmM ATP f SIGMA} 


15|il 


10ma/ml acetvlated BSA fNEB") 

- X V/lllty JLi.ll Hvwl Y fCUvU JLJUiX ~ J 


IllI 


RepA DNA 


40li1 


TAC-library DNA 


40li1 


Bspl20I (10u/j.il Fermentas) 


5ji1 


NotI (10u/|ilNEB) 


5nl 


T4 DNA ligase (400u/^il NEB) 


S|il 


Water 


39li1 



Reaction was carried out at 37°C for two hours. 20[xl was assessed by gel 
electrophoresis, 30^1 was PGR amplified directly in lOx 50|il reactions, and the 
remainder was gel purified and the library band excised, Qiagen column purified and 
1 5 PCR amplified in 20x 50|il reactions with primers TACFAR4 (SEQ ID 30) and 

ORIREV (SEQ ID 02) for 30 cycles of 94°C, 40 seconds; 60°C, 40 seconds; 72°C, 
90 seconds; followed by a 5 minutes extension at 72°C. DNA was gel purified in 4 
Qiagen columns and the 200jil eluate pooled for ITT reactions/selection. 

20 Round 1 Selection 

2 x 200|il ITT reaction was set up and incubated at room temperature for 1 
hour as follows: 



REACTION 1 

25 Library DNA 56^1 (7^g) 

2.5x buffer 80|il 

lOOmM methionine 2fil 

S30 extract 60\x\ 



30 



lml of blocking buffer was added to each reaction (Block buffer is 4% 
Marvel, 100^g/ml sheared salmon sperm DNA, 0.1% Tween 20, 2.5mg/ml heparin, 
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in TBS), spun at 10,000g for 2 minutes, transferred to a fresh tube, then placed on 
ice. 

100 p.1 Bacillus globigii (Bg) spore suspension was washed twice with 1ml 
TBS/0.1% Tween 20 and was resuspended in 100|il of Block buffer. This was then 
5 added to the Block buffer and allowed to bind at room temperature for 1 hour whilst 
mixing. 

The mix was then centrifuged at 16,100g for 1 minute and the spore pellet 
was washed six times with 1 ml of TBS/0.1% Tween 20 by mixing with a pipette and 
vortexing prior to centrifugation. The pellet was finally washed in 1ml TBS and the 

1 0 supernatant was discarded. 

DNA was eluted by incubation of the spores in 120|il 0.5M sodium acetate 
pH5.5 for 10 minutes on a mixer. The spores were centrifuged at 16,100g for 1 
minute and the supernatant was neutralised by the addition of 120^1 Tris pH8.0 and 
then phenol/CHCB extracted for 5minutes at 16,100g. DNA was precipitated with 

15 20^g carrier glycogen and two and a half volumes of ethanol. DNA was pelleted at 
16,100g for 20 minutes and the pellet washed three times with 0.75ml 70% ethanol, 
centrifuging for 3 minutes at 16,100g in between each wash, then air dried and re- 
suspended in 20^1 water. 

IOjj.1 recovered DNA was PCR amplified in 10 * 50\x\ reaction with primers 

20 CISREV (SEQ ID 19) and TACFAR5 (SEQ ID 3 1) and 2 units of 20: 1 Taq DNA 
polymerase: Deep Vent DNA polymerase mixture (NEB) for 30 cycles of 94°C, 40 
seconds; 60°C, 40 seconds; 72°C, 90 seconds; followed by a 5 minutes extension at 
72°C. The DNA was purified, ethanol precipitated and re-suspended in lOjal water. 5 
\xl were further amplified by PCR using the conditions above but using the primers 

25 NOTRECREV2 (SEQ ID 32) and TACFAR5 (SEQ ID 31) for 10 cycles. The 

product was purified using a Qiagen PCR purification kit and eluted into 50\xl 5mM 
Tris pH 8.0. 

Restriction-Li gation 

30 This was carried out in a 30^1 reaction for 1 hour at 37°C to reattach RepA- 

ClS-ori DNA to recovered peptides for a further round of selection. 
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lOxNEB buffer 4 


3ul 


lOOmM ATP f SIGMA} 


1.5|al 


lOme/ml acetvlated BSA fNEB) 


0.3|il 


RepA DNA 


2|il 


TAC-library DNA 


10nl 


Bspl20I (10u/nl Femientas) 


1.5|il 


NotI(10u/filNEB) 


1.5|xl 


T4 DNA ligase (400u/nl NEB) 


1.5^1 


Water 


8.7^1 



20[il was PCR amplified directly in lOx 50jil reactions with primers 
TACFAR5.1 (SEQ ID 33) and ORIREV (SEQ ID 02) for 20 cycles of 94°C, 40 
seconds; 60°C, 40 seconds; 72°C, 90 seconds; followed by a 5 minutes extension at 
15 . 72°C. DNA was gel purified in 1 Qiagen column and the eluate used for Round 2 of 
ITT reactions/selection (58|il used in R2). 

Round 2 

Second round selection was earned out as for round 1 , with the following 
20 changes: Approximately 3ng of input DNA were used. Block buffer was 2% bovine 
serum albumin, 1% gelatin, 100|ag/ml sheared salmon sperm DNA, 2.5mg/ml 
heparin, in TBS. 10p.l washed spores used in each selection. Recovery PCRs used 
TACFAR5.2 (SEQ ID 34) and NOTRECREV2 (SEQ ID 32) primers. Finally, pull 
through PCR used TACFAR5.2 (SEQ ID 34) and ORIREV (SEQ ID 02) primers for 
25 10 cycles. 

Round 3 

Third round selection was earned out as for round 2, with the following 
changes: Approximately 2.5|ag of input DNA was used. Recovery PCRs used 
30 TACFAR6 (SEQ ID 35) and NOTRECREV2 (SEQ ID 32) primers. Finally, pull 
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through PCR used TACFAR6 (SEQ ID 35) and ORIREV (SEQ ID 02) primers for 
10 cycles. 



Round 4 

5 Round 4 was carried out as for round 3, except that approximately 2 \xg of 

input DNA was used for the selection. Recovery PCRs used TAC3 (SEQ ID 09) and 
NOTRECREV2 (SEQ ID 32) primers. Finally, pull through PCR used TAC3 (SEQ 
ID 09) and ORIREV (SEQ ID 02) primers for 10 cycles. 

10 Round 5 

Round 5 was carried out as for round 4. 

For cloning out as NcoI-NotI fragments, the stored the recovered DNA from 
round 5 was PCR amplified with biotinylated TAC6 (SEQ ID 26) primer and 
NOTRECREV2 (SEQ ID 32). Digestion with NotI was followed by purification 

1 5 using Qiagen PCR purification kit, digestion with Ncol followed by incubation in a 
plate coated with streptavidin. Following phenol/CHC13 purification and ethanol 
precipitation, the digested DNA was then ligated into a similarly digested pVIII 
phagemid vector and transformed into ER2738 E.coli, then plated on 2% glucose, 
2xTY, 100|ig/ml ampicillin plates and incubated o/n at 37°C. 

20 Individual colonies were picked into 200^1 2% glucose, 2xTY, 100|xg/ml 

ampicillin medium in 96 well plates, and grown at 37°C/200rpm for 6 hours. 100p.l 
was transferred to a deep-well plate containing 100fil 2% glucose, 2xTY, 100|ag/ml 
ampicillin plus lOjal M13K07 helper phage/well and incubated for 1 hour without 
• shaking at 37°C. 500(il per well of 2xTY, lOO^g/ml ampicillin/ 25|ig/ml kanamycin/ 

25 20|iM IPTG medium was added and incubation carried out o/n at 37°C/200rpm. 

ELISA screening 

Round bottom 96 well plates were blocked with 4% Marvel in 
TBS/0. l%Tween 20 in PBS for 1 hour at room temperature. Picked phage cultures 
30 were centrifuged at 3000g for 5 minutes and the phage supernatant was assayed in an 
ELISA. In each well 50jal of phage supernatant were mixed with 5\x\ Bg spores in 
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50(il 4% bovine serum albumin, 1% gelatin in TBS and incubated whilst shaking at 
room temperature for 1 hour. The wells were washed 5 * with 200|xl TBS/0.1% 
Tween20 by centrifugation at 3000g for 5 minutes in between each wash before 
incubation with anti-M13 horseradish peroxidase conjugated antibody 0.2|ag/ml in 
5 4% bovine serum albumin, 1% gelatin in TBS. The spores were incubated at room 
temperature for 1 hour whilst shaking. The wells were then washed 5 x with 
TBS/0.1% Tween20 and the spores were transferred into a fresh plate. The spores 
were then washed once with TBS as described above before development with TMB 
substrate. The development was stopped with 0.5M H2SO4 and the solution was 
10 transferred to a fresh flat-bottomed plate for reading at 450nm. Binding data for 
selected peptides is shown in Figure 7. 

Example 5. CIS display Library construction, selection & screening against 
anti-V5 antibody 

15 Library Construction was carried out as described in Example 4. 

Round 1 Selection 

Ix 200|il in vitro transcription/translation reaction (ITT) reaction was set up 
and incubated at room temperature for 1 hour as described in Example 4. 1ml of 
20 blocking buffer was added to each reaction (Block buffer is 5% skimmed milk 
powder, lOO^ig/ml sheared salmon sperm DNA, 2.5mg/ml heparin, in TBS), then 
placed on ice.. 

For the first round of library selection a 70x1 1mm NUNC Maxisorp 
Immunotube (Life Technologies, Paisley, Scotland U.K.) was coated with 1ml of 

25 10(ig/ml of polyclonal anti-V5 peptide antibody (Harlan-Seralab) in PBS for 1 hour 
at 37°C. The tube was rinsed three times with PBS (fill & empty) and blocked with 
3 mi block buffer for 1 hour at 37°C and washed as before. Library protein-DNA 
complexes in block buffer were added, and incubated for 1 hour standing at room 
temperature. The tube was washed five times with PBS/0.1% Tween 20, then a 

30 further five times with PBS only. 
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DNA was eluted into 500|ul 1M Sodium acetate pH 5.2 for 10 minutes on the 
blood mixer, neutralized with 100^x1 !M Tris-HCl pH 8.0, then phenol/CHC13 
extracted for Sminutes at 16,100g. DNA was precipitated with 20|ig carrier glycogen, 
X A volume 7.5M ammonium acetate, and three volumes of ethanol. DNA was pelleted 

5 at 16,100g for 20 minutes and the pellet washed with 0.5ml 70% ethanol for 5 
minutes at 16,100g then vacuum dried, and re-suspended in 20\xl water. 

10)-il recovered DNA was PCR amplified in lx 50|il reaction with primers 
NOT1RECREV2 (SEQ ID 32) and TACFAR4 (SEQ ID 30) and 2 units of 20:1 Taq 
DNA polymerase: Deep Vent DNA polymerase mixture (NEB) for 30 cycles of 

10 94°C, 40 seconds; 60°C, 40 seconds; 72°C, 90 seconds; followed by a 5 minutes 
extension at 72°C. 50^1 of reaction product were electrophoresed on a 1% 
agarose/TAE gel and photographed, then GeneClean purified into 10 yX water. DNA 
was reattached to RepA DNA and reamplified for round two as described in example 
4 using TACFAR5 (SEQ ID 31) and ORIREV (SEQ ID 02) primers. 

15 Second round selection was carried out as for round 1 , using the same primer 

pairs as described in example 4, with the following changes: Anti-V5 antibody 
coating concentration was reduced to Spg/ml. Input DNA was approximately 4^ig. 
Third round selection was carried out as for round 2, with the following changes: 
Approximately 4(ag of input DNA was used. Recovery PCRs used TACFAR5.1 

20 (SEQ ID 33) and NOTRECREV2 (SEQ ID 32) primers. Finally, pull through PCR 
used TACFAR6 (SEQ ID 35) and ORIREV (SEQ ID 02) primers for 10 cycles. 
Round 4 was carried out as for round 3. 

For cloning out as NcoI-NotI fragments, the stored the recovered DNA from 
round 4 the recovered DNA from round 4 was PCR amplified with biotinylated (SEQ 

25 ID 26) TAC6 and NOTIREPRECREV2 (SEQ ID 32) primers and cloned into pVIII 
phagemid vector and electroporated into electrocompetent TG-1 E.coli, as described 
in example 4. 

Individual colonies were picked into 200 \xl 2% glucose, 2xTY, 100(ig/ml 
ampicillin medium in 96 well plates, and grown at 37°C/200rpm for 6 hours. 100|il 
30 was transferred to a deep-well plate containing lOOjxl 2% glucose, 2xTY, lOOjig/ml 
ampicillin plus 10 9 kru M13K07 helper phage/well and incubated for 1 hour without 
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shaking at 37°C. 400jal per well of 2xTY, 100^g/ml ampicillin/ 25ng/ml kanamycin/ 
20(xM IPTG medium was added and phage amplification continued for 16 hours at 
37°C while shaking at 200rpm. Bacterial cultures were spun in microtitre plate 
carriers at 2000g for 10 minutes at 4°C in a benchtop centrifuge to pellet bacteria and 
5 culture supernatant used for ELISA. 

A NUNC Maxisorp ELISA plate was coated with lOOng/well anti-V5 peptide 
antibody in lOOjal /well PBS for one hour at 37°C. The plate was washed 
2x200|il/well PBS and blocked for 1 hour at 37°C with 200|il/well 2% BSA/PBS and 
then washed 2x200|il/well PBS. 50\i\ phage culture supernatant was added to each 

10 well containing 50(il/well 4% BSA/PBS, and allowed to bind for 1 hour at room 

temperature. The plate was washed two times with 200^1/well PBS/0.1% Tween 20, 
then two times with 200^1/well PBS. Bound phage were detected with lOO^l/well, 
1:5000 diluted anti-M13-HRP conjugate (Amersham-Pharmacia) in 2% BSA/PBS 
for 1 hour at room temperature and the plate washed four times as above. The plate 

15 was developed for 5 minutes at room temperature with 100}il/well TMB (3,3\5,5*- 
Tetramethylbenzidine) substrate buffer. The reaction was stopped with 100^1/well 
0.5N H 2 S0 4 and read at 450nm. Phagemid DNA of ELISA positive clones were then 
sequenced with standard pUC forward and reverse sequencing primers. The amino 
acid sequence of these clones isolated is shown below. Four ELISA positive clones 

20 were grown in 10ml culture volumes and phage particles precipitated with PEG-NaCl 
and re-suspended in 1ml PBS and 50^1 retested in ELISA as described above. 
OD450nm signals against anti-V5 and control anti-ACTH peptide antibody are 
shown in Figure 8. 

25 Peptide sequences isolated after selection: 

P1C12 (SEQ ID 36) CGCPTMAARVRP VLNSKH 

P2H1 (SEQ ID 37) MTTVPVLMISV 
P1B5(SEQID38) T L S T R H H N V I D R F N L RN F 

P2B8 (SEQ ID 39) S I RTL T G S TP A Q F D AT AD 



30 
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Example 6. Selection of ovalbumin binding peptides from a CIS display library 

For any selection methodology it is important that the selected entities are 
capable of binding to the target selected against, independently of the carrier 
molecule associated with it during selection and screening. In this example, selected 

5 peptides are selected and synthesized to allow confirmation of target binding. 

Random 12mer peptide library construction was carried out as described in Example 
3. Four rounds of selection were carried out as described in example 4 with 100|j,g/ml 
ovalbumin (SIGMA, Dorset, UK) coated onto immunotubes. 

For cloning out as NcoI-NotI fragments, the recovered DNA from round 4 

1 0 was PCR amplified with biotinylated TAC6 (SEQ ID 26) and NOTIREPRECREV2 
(SEQ ID 32) primers and cloned into a pVIII phagemid vector and electroporated into 
electrocompetent TG-1 Exoli, as described in example 4. 

Individual colonies were picked into 200|il 2% glucose, 2xTY, 100|ig/ml 
ampicillin medium in 96 well plates, and grown at 37°C/200rpm for 6 hours, \00\xl 

15 was transferred to a deep-well plate containing 100^1 2% glucose, 2xTY, 100^g/ml 
ampicillin plus 10 9 kru M13K07 helper phage/well and incubated for 1 hour without 
shaking at 37°C. 400\x\ per well of 2xTY, 100^g/ml ampicillin/ 25|ng/ml kanamycin/ 
20jiM IPTG medium was added and phage amplification continued for 16 hours at 
37°C while shaking at 200rpm. Bacterial cultures were spun in microtitre plate 

20 carriers at 2000g for 1 0 minutes at 4°C in a benchtop centrifuge to pellet bacteria and 
culture supernatant used for ELIS A. 

A NUNC Maxisorp ELIS A plate was coated with 100fag/well ovalbumin in 
IOOjlxI /well PBS overnight at 4°C. The plate was washed 2x200nl/well PBS and 
blocked for 1 hour at 37°C with 200^il/well 2% BSA/PBS and then washed 

25 2x200|j.l/well PBS. 50|j.l phage culture supernatant was added to each well containing 
50|j.l/well 4% BSA/PBS, and allowed to bind for 1 hour at room temperature. The 
plate was washed two times with 200|al/well PBS/0.1% Tween 20, then two times 
with 200jil/well PBS. Bound phage were detected with 100|il/well, 1 :5000 diluted 
anti-M13-HRP conjugate (Amersham-Pharmacia) in 2% BSA/PBS for 1 hour at 

30 room temperature and the plate washed four times as above. The plate was developed 
for 5 minutes at room temperature with 100|il/well TMB (3,3',5,5'- 
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Tetraniethylbenzidine) substrate buffer. Hie reaction was stopped with lOO^l/well 
0.5N H2SO4 and read at 450rmi. Phagemid DNA of ELISA positive clones were then 
sequenced with M13REV primer. The amino acid sequence of these clones isolated 
is shown below. 

5 

CI (SEQ ID 40) ANLWRIVLHGWW 
C4 (SEQ ID 41) VSFMLLGPHRHR 
C6 (SEQ ID 42) LVLHWLSLGSR 
C8 (SEQ ID 43) SNQVVLILHLRP 
1 0 Control (SEQ ID 44) AESWLHQSWIHL 



Peptide sequences from four representative ELISA positive clones were 
synthesized (SIGMA-Genosys Ltd) with biotin added to the C-terminus to aid 
detection in ELISA. These peptides were tested in ELISA against ovalbumin, along 

15 with a control peptide previously isolated by phage display selection against 

B.globigii spores. A NUNC Maxisorp ELISA plate was coated with 100[ig/well 
ovalbumin in lOOfxl /well PBS, or, 200ng/well ant-V5 polyclonal antibody in PBS, 
overnight at 4°C. The plate was washed 2x200jil/well PBS and blocked for 1 hour at 
37°C with 200nl/well 2% skimmed milk powder/PBS and then washed 2x200|il/well 

20 PBS. l\ig of diluted peptides were added to each well in lOOfil/well 2% BSA/PBS, 
and allowed to bind for 1 hour at room temperature. The plate was washed two times 
with 200jil/well PBS/0.1% Tween 20, then two times with 200^il/well PBS. Bound 
peptides were detected with 100|il/well, 1:2000 diluted streptavidin-HRP conjugate 
(Pierce) in 2% BSA/PBS for 1 hour at room temperature and the plate washed four 

25 times as above. The plate was developed for 5 minutes at room temperature with 

100^/well TMB (3,3\5,5'-Tetramethylbenzidine) substrate buffer. The reaction was 
stopped with. lOOjxl/well 0.5N H 2 S0 4 and read at 450nm (Figure 9). 



Example 7. Display of single-chain Fv antibody (scFv) fragments in a CIS 
30 display system 
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A tac-scFv-RepA-CIS-ori construct was constructed by PCR overlap 
extension essentially as described previously in example 1 . Anti-mecoprop scFv 
DNA (Haptogen Ltd, Aberdeen, UK) was amplified in a 50jil reaction volume 
containing 200^M dNTPs, lxNEB polymerase amplification buffer (lOmM KC1, 
5 lOmM (NH4) 2 S0 4 > 20mM Tris-HCl pH 8.8, 2mM MgS04, 0.1% TritonX-100) with 
lOpmol of each of the primers TACMECOFOR (SEQ ID 45) and REPAMECOBAK 
(SEQ ID 46) and 2 units of 20:1 Taq DNA polymerase: Deep Vent DNA polymerase 
mixture (NEB) for 30 cycles of 94°C, 40 seconds; 60°C, 40 seconds; 72°C, 80 
seconds; followed by a 5 minutes extension at 72°C. Products were electrophoresed 

10 on a 1% agarose/TAE gel and purified with a Geneclean II kit into 20|il water. This 
was assembled with RepA-CIS-ori DNA generated with ORIREV408 (SEQ ID 20) 
and MECOREPAFOR (SEQ ID 47), and Tac promoter DNA generated with 
TACFARUP (SEQ ID 23) and MECOTACB AK (SEQ ID 48) in a 50^1 reaction 
volume containing 200|iM dNTPs, lxNEB polymerase amplification buffer (lOmM 

15 KC1, lOmM (NH 4 )2S0 4 , 20mM Tris-HCl pH 8.8, 2mM MgS04, 0.1% TritonX-100) 
with lOpmol of each of the primers TAC3 (SEQ ID 09) and ORIREV (SEQ ID 02) 
and 2 units of 20:1 Taq DNA polymerase: Deep Vent DNA polymerase mixture 
(NEB) for 30 cycles of 94°C, 40 seconds; 60°C, 40 seconds; 72°C, 80 seconds; 
followed by a 5 minutes extension at 72°C. Products were electrophoresed on a 1% 

20 agarose/TAE gel and purified with a Geneclean II kit into 20jjl1 water. 

DNA was reamplified in lOx 5Q\xl reactions containing 200^M dNTPs, 
lxNEB polymerase amplification buffer (lOmM KC1, lOmM (NH4) 2 S0 4 , 20mM 
Tris-HCl pH 8.8, 2mM MgS04, 0.1% TritonX-100) with lOpmol of each of the 
primers TAC3 (SEQ ID 09) and ORIREV (SEQ ID 02) and 2 units of 20:1 Taq DNA 

25 polymerase: Deep Vent DNA polymerase mixture (NEB) for 30 cycles of 94°C, 40 
seconds; 60°C, 40 seconds; 72°C, 80 seconds; followed by a 5 minutes extension at 
72°C. Products were electrophoresed on a 1% agarose/TAE gel and purified with a 
Geneclean II kit into 100^1 water. 



30 ScFvDNA was then translated in the following two reaction conditions: 
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REACTION ' 1 2 

Tac-ScFv-RepA DNA 1 28jil(ljig) 28jnl (l|ag) 

2.5x buffer 40|al 40^1 

1 OmM methionine 1 (il 1 |al 

5 H 2 0 . Ifil 

S30 extract 30jal 30jal 



Reactions were incubated at 30°C for 30 minutes then 0.25M ox- 
glutathione added to reaction 2 and incubation at 30°C continued for a further 30 

10 . minutes. 1ml of blocking buffer was added to each reaction (Block buffer is 1% 

gelatin, lOOjig/ml sheared salmon sperm DNA, 2.5mg/ml heparin, in TBS), spun at 
10,000g for 2 minutes, then placed on ice. 

NUNC star immunotubes were coated with 0.5ml 10|ig/ml BSA-mecoprop 
conjugate, or 10^g/ml BSA in PBS for 1 hour at 37°C. Tubes were washed 2x PBS, 

15 then blocked for 1 hour at room temperature with 3ml blocking buffer on a blood 
mixer, then tubes were washed 2x PBS. 

0.5ml of each diluted ITT was added to either a blocked BSA coated or BSA- 
mecoprop coated tube and incubated at room temperature for 1 hour. Tubes were 
washed 5x TBS/0. 1% Tween 20, 5x TBS. 

20 Bound DNA was eluted for 10 minutes at room temperature with 0.5ml of 

0.5M NaCl/1 OmM Tris pH 8, 1 mM EDTA, then extracted with 0.5mL 
phenol/chloroform and precipitated with 20(ig carrier glycogen, Vi volume 7.5M 
ammonium acetate, and three volumes of ethanol. DNA was pelleted at 14 5 000g for 
20 minutes and the pellet washed with 0.5ml 70% ethanol for 5 minutes at 14,000g 

25 then vacuum dried, and re-suspended in 20jxl water. 

IOjlxI of recovered DNA was PCR amplified in a 50\x\ reaction volume 
containing 200 |liM dNTPs, lxNEB polymerase amplification buffer (lOmM KC1, 
lOmM (NH 4 ) 2 S0 4 , 20mM Tris-HCl pH 8.8, 2mM MgS04, 0.1% TritonX-100) with 
lOpmol of each of the primers TACMECOFOR (SEQ ID 45) and REPAMECOBAK 

30 (SEQ ID 46) and 2 units of 20:1 Taq DNA polymerase: Deep Vent DNA polymerase 
mixture (NEB) for 30 cycles of 94°C, 40 seconds; 60°C, 40 seconds; 72°C, 80 
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seconds; followed by a 5 minutes extension at 72°C. Products were electrophoresed 
on a 1% agarose/TAE gel and photographed. Greater amounts of DNA were 
observed from selections on antibody target than with recovered from BSA coated 
tubes, indicating that functional scFv-RepA-DNA complexes were being selected 
5 (Figure 10). 
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CLAIMS 

1 . A method for producing an in vitro peptide expression library 
comprising a plurality of peptides, wherein each peptide is linked to the DNA 
construct encoding the peptide, comprising the steps of: 

(a) providing a DNA construct comprising: 

(i) a DNA target sequence; 

(ii) DNA encoding a library member peptide; and 

(iii) DNA encoding a peptide capable of non-covalently binding 
directly or indirectly to said DNA target sequence of (i); 

wherein said DNA construct and encoded protein are selected to have 
cis-activity 

(b) expressing a plurality of DNA constructs according to (a) wherein said 
DNA constructs encode a plurality of library member peptides such 
that each expressed peptide is non-covalently linked to the DNA from 
which it was produced. 

2. A method according to claim 1 wherein said DNA construct further 
comprises: (iv) a DNA element that directs cis-activity. 

3. A method according to claim 2 wherein said DNA construct of (a) 
further comprises 

(v) DNA encoding a fragment comprising at least the C-terminal 
20 amino acids of a repA protein wherein said fragment is 
capable of interacting with said DNA element of (iv); 
wherein said DNA element of (iv) is located 3' to said DNA of (ii), (iii) and (v). 

4. A method according to any one of the preceding claims wherein the 
peptide encoded by said DNA of (iii) is capable of recognising and directly binding 
said DNA target sequence of (i). 



5. A method according to claim 4 wherein the peptide encoded by said 
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DNA of (iii) is a rep A protein and wherein said DNA target sequence of (i) is ori. 

6. A method according to claim 4 or 5 wherein said DNA of (ii) is linked 
to said DNA of (i) and (iii) by restriction enzyme digestion and ligation. 

7. A method according to any one of claims 3 to 6 wherein said rep A is 
selected from rep A of the IncI complex plasmids and repA of the IncF, IncB, IncK, 
IncZ and IncL/M plasmids. 

8. A method according to claim 5 wherein said DNA construct 
comprises the sequence encoding repA, the cis DNA element and the ori DNA of the 
IncFII plasmid Rl. 

9. . A method according to any one of the preceding claims 
wherein said rep A protein has the sequence given in SEQ ID NO: 16 and wherein 
said cis DNA element has the sequence given in SEQ ID NO: 17. 

10. A method according to any one of the preceding claims wherein DNA 
not bound by the peptide encoded by said DNA of (iii) is bound by non-specific 
DNA binding protein. 

11. A method according to claim 4 wherein the peptide encoded by said 
DNA of (iii) is an oestrogen receptor DNA binding domain and wherein said DNA 
target sequence of (i) is an oestrogen receptor target sequence. 

12. A method according to claim 9 wherein said DNA. binding domain 
comprises amino acids 176 to 282 of the oestrogen receptor DNA binding fragment 
and wherein said DNA target sequence comprises the oestrogen receptor target 
sequence given in SEQ ID NO: 14. 
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13. A method according to claim 1 , 2 or 3 wherein the peptide encoded by 
said DNA of (iii) indirectly binds said DNA target sequence of (i) via a bifunctional 
agent, one part of which binds said DNA target sequence of (i) and a second part of 
which binds the peptide encoded by said DNA of (iii). 

14. A method according to claim 13 wherein said DNA target sequence 
comprises a DNA tag capable of being bound by said bifunctional agent, said tag 
being optionally selected from biotin and fluorescein. 

15. A method according to claim 13 or 14 wherein the binding activities 
of said bifunctional agent are conferred by means of two antibodies or fragments 
thereof. 



16. A method according to claim 1 5 wherein one or both of said binding 
activities are conferred by means of an Fab fragment. 

17. A method according to any one of claims 13 to 16 wherein said 
bifunctional agent is provided prior to step (b). 

18. A method according to claim 1 3 wherein said bifunctional agent is 
bound to said DNA target sequence of (i) and is capable of binding to the peptide 
encoded by said DNA of (iii). 

19. A method according to claim 18 wherein said bifunctional agent is a 
polymer. 

20. A method according to any one of the preceding claims wherein said 
DNA is under the control of suitable promoter and translation sequences to allow for 
in vitro transcription and translation. 
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21. A method according to any one of the preceding claims wherein said 
library member peptide is an antibody or fragment thereof. 

22. A method according to any one of the preceding claims 
wherein said library comprises at least 10 4 molecules. 

23. A method according to any one of the preceding claims wherein 
said expression is earned out in the presence of a compound that prevents nuclease 
activity, or reduces non-specific DNA-protein or protein-protein interactions. 

24. A method according to any one of the preceding claims wherein said 
expression is carried out in a coupled bacterial transcription/translation environment. . 

25. A method according to claim 24 wherein said coupled bacterial 
transcription/translation environment is the S30 extract system. 

26. A method for producing an in vitro peptide expression library 
comprising a plurality of peptides, wherein each peptide is linked to the DNA 
construct encoding the peptide, comprising the steps of: 

(a) providing a DNA construct comprising: 

(i) DNA encoding a library member peptide; and 

(ii) DNA encoding a peptide capable of binding to a bifunctional 
agent; 

wherein said DNA construct and encoded protein are selected to have 
cis-activity; 

(b) binding a bifunctional agent or a DNA tag capable of binding a 
bifunctional agent to said DNA construct of (a), wherein said 
bifunctional agent is capable of binding to the peptide encoded by said 
DNA of(ii); and 

(c) expressing a plurality of DNA constructs according to (b), wherein 
said DNA constructs encode a plurality of library member peptides 
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such that each expressed peptide is linked via said Afunctional agent 
to the DNA from which it was produced. 

27. A method of identifying and/or purifying a peptide exhibiting desired 
properties from an in vitro peptide expression library produced according to the 
method of any one of the preceding claims, comprising at least the steps of (a) 
screening said library and (b) selecting and isolating the relevant library member. 

28. A method of identifying a specific ligand binding peptide, said 
method comprising at least the steps of (a) screening an in vitro peptide expression 
library produced according to the method of any one of claims 1 to 26 with ligand 
molecules which are optionally bound to a solid support; (b) selecting and isolating a 
library member binding to said ligand molecule; and (c) isolating the peptide which 
binds specifically to said ligand molecule. 

29. A method according to claim 27 or 28 wherein said library member 
peptides are antibodies or fragments thereof. 

30. .A method of identifying and/or purifying a peptide having the ability 
to bind a specific DNA target sequence comprising at least the steps of 

(a) providing an in vitro expression library according to any one of claims 
1 to 26 wherein the peptide encoded by the DNA of (iii) is a library 
member peptide having DNA binding activity and wherein said DNA 
target sequence of (i) is the target sequence of interest; 

(b) selecting and isolating a library member in which the encoded protein 
binds to said target sequence; and 

(c) isolating the peptide which binds to said target sequence. 



31. A method according to claim 30 wherein said library member peptides 
are zinc finger proteins, helix-loop-helix proteins or helix-turn-helix proteins. 
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32. A method according to any one of claims 27 to 3 1 wherein said 
screening and/or selecting step is carried out in the presence of a compound that 
prevents nuclease activity or reduces non-specific DNA-protein or protein-protein 
interactions. 

33. A method according to claim 32 wherein said compound is heparin. 

34. A method according to any one of claims 27 to 33 wherein 
additionally the DNA expressing said isolated peptide is isolated. 

35. A method according to claim 34 further comprising cloning said DNA 
into an expression vector. 

36. A method according to claim 35 further comprising instroducing said 
expression vector into a cell in vifro. 

37. A method according to claim 35 or 36 further comprising expressing 
the peptide encoded by said DNA. 

38. An in vitro peptide expression library produced according to the 
method of any one of claims 1 to 26. 



39. 



A DNA construct as described in any one of claims 1 to 26. 
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SEQUENCE LISTING 

<110> ISOGENICA LIMITED 

<120> PEPTIDE LIBRARY DISPLAY METHOD 

<130> N.86234C SER 

<150> GB 0220759.5 

<151> 2002-09-06 

<150> GB 0304521.8 

<151> 2003-02-27 

<150> GB 0304657.0 

<151> 2003-02-28 

<160> 48 

<170> Patentln version 3.1 



<210> 1 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 1 

actgatcttc accaaacgta tta 23 



<210> 2 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223>. Primer 

<400> 2 

tgcatatctg tctgtccaca gg 



<210> 3 

<211> 48 

<212> DNA 

<213> Artificial sequence 



WO 2004/022746 ^B>CT/GB2003/003860 



<220> 

<223> Primer 
<400> 3 

gagcttcaac aggggagggg gaggaggatc aactgatctt caccaaac 48 



<210> 4 

<211> 50 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 4 

ctaggactgg attcaacggg gggaggagga tcaactgatc ttcaccaaac 50 



<210> 5 

<211> 49 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 5 

cagaagagga tctgaatggg ggaggagggt ccactgtggc tgcaccatc 49 



<210> 6 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 6 

tcccctgttg aagctctttg tg 22 



<210> 7 

<211> 40 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer 
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<400> 7 

cagaagagga tctgaatggg ggaggagggt ccggaaaacc 40 



<210> 8 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 8 

gctacgttga atccagtcct aggagag 27 



<210> 9 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 9 

catattgtcg ttagaacgcg gc 22 



<210> 10 

<211> 58 

<212> DNA 

<213> Artificial sequence 

i 

<220> 

<223> Primer 

<400> 10 

attcagatcc tcttctgaga tgagtttttg ttcctcgagc atggtagatc ctgtttcc 58 



<210> 11 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<400> 11 

cgatacctag cgttcggatc catattgtcg ttagaacgcg gc 



42 



WO 2004/022746 




CT/GB2003/003860 



4 



<210> 12 
<211> 1788 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> DNA construct 
<400> 12 

catattgtcg ttagaacgcg gctacaatta atacataacc ttatgtatca tacacatacg. 60 

atttaggtga cactatagaa tacaagctta ctccccatcc ccctgttgac aattaatcat 120 

ggctcgtata atgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acaggatcta 180 

ccatgctcga ggaacaaaaa ctcatctcag aagaggatct gaatggggga ggagggtcca 240 

ctgtggctgc accatctgtc ttcatcttcc cgccatctga tgagcagttg aaatctggaa 300 

ctgcctctgt tgtgtgcctg ctgaataact tctatcccag agaggccaaa gtacagtgga 360 

aggtggataa cgccctccaa tcgggtaact cccaggagag tgtcacagag caggacagca 420 

aggacagcac ctacagcctc agcaacaccc tgacgctgag caaagcagac tacgagaaac 480 

acaaagtcta cgcctgcgaa gtcacccatc agggcctgag ctcgcccgtc acaaagagct 540 

tcaacagggg agggggagga ggatcaactg atcttcacca aacgtattac cgccaggtaa 600 

agaacccgaa tccggtgttc actccccgtg aaggtgccgg aacgccgaag ttccgcgaaa 660 

aaccgatgga aaaggcggtg ggcctcacct cccgttttga tttcgccatt catgtggcgc 720 

atgcccgttc ccgtggtctg cgtcggcgca tgccaccggt gctgcgtcga cgggctattg 780 

atgcgctgct gcaggggctg tgtttccact atgacccgct ggccaaccgc gtccagtgtt 840 

ccatcaccac actggccatt gagtgcggac tggcgacaga gtccggtgca ggaaaactct 900 

ccatcaeccg tgccacccgg gccctgacgt tcctgtcaga gctgggactg attacctacc 960 

agacggaata tgacccgctt atcgggtgct acattccgac cgacatcacg ttcacactgg 1020 

ctctgtttgc tgcccttgat gtgtctgagg atgcagtggc agctgcgcgc cgcagtcgtg 1080 

ttgaatggga aaacaaacag cgcaaaaagc aggggctgga taccctgggt atggatgagc 1140 

tgatagcgaa agcctggcgt tttgtgcgtg agcgtttccg cagttaccag acagagcttc 1200 

agtcccgtgg aataaaacgt gcccgtgcgc gtcgtgatgc gaacagagaa cgtcaggata 1260 

tcgtcaccct agtgaaacgg cagctgacgc gtgaaatctc ggaaggacgc ttcactgcta 1320 

atggtgaggc ggtaaaacgc gaagtggagc gtcgtgtgaa ggagcgcatg attctgtcac 1380 

gtaaccgcaa ttacagccgg ctggccacag cttctccctg aaagtgatct cctcagaata 1440 

atccggcctg cgccggaggc atccgcacgc ctgaagcccg ccggtgcaca aaaaaacagc 1500 

gtcgcatgca aaaaacaatc tcatcatcca ccttctggag catccgattc cccctgtttt 1560 

taatacaaaa tacgcctcag cgacggggaa ttttgcttat ccacatttaa ctgcaaggga 1620 

cttccccata aggttacaac cgttcatgtc ataaagcgcc agccgccagt cttacagggt 1680 

gcaatgtatc ttttaaacac ctgtttatat ctcctttaaa ctacttaatt acattcattt 1740 

aaaaagaaaa cctattcact gcctgtcctg tggacagaca gatatgca 1788 



<210> 13 

<211> 1518 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> DNA construct 
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<400> 13 

catattgtcg ttagaacgcg gctacaatta atacataacc ttatgtatca tacacatacg 60 

atttaggtga cactatagaa tacaagctta ctccccatcc ccctgttgac aattaatcat 120 

ggctcgtata atgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acaggatcta 180 

ccatgctcga ggaacaaaaa ctcatctcag aagaggatct gaatggggga ggagggtccg 240 

gaaaacctat cccaaaccct ctcctaggac tggattcaac ggggggagga ggatcaactg 300 

atcttcacca aacgtattac cgccaggtaa agaacccgaa tccggtgttc actccccgtg 360 

aaggtgccgg aacgccgaag ttccgcgaaa aaccgatgga aaaggcggtg ggcctcacct 420 

cccgttttga tttcgccatt catgtggcgc atgcccgttc ccgtggtctg cgtcggcgca 480 

tgccaccggt gctgcgtcga cgggctattg atgcgctgct gcaggggctg tgtttccact 540 

atgacccgct ggccaaccgc gtccagtgtt ccatcaccac actggccatt gagtgcggac 600 

tggcgacaga gtccggtgca ggaaaactct ccatcacccg tgccacccgg gccctgacgt 660 

tcctgtcaga gctgggactg attacctacc agacggaata tgacccgctt atcgggtgct 720 

acattccgac cgacatcacg ttcacactgg ctctgtttgc tgcccttgat gtgtctgagg 780 

atgcagtggc agctgcgcgc cgcagtcgtg ttgaatggga aaacaaacag cgcaaaaagc 840 

aggggctgga taccctgggt atggatgagc tgatagcgaa agcctggcgt tttgtgcgtg 900 

agcgtttccg cagttaccag acagagcttc agtcccgtgg aataaaacgt gcccgtgcgc 960 

gtcgtgatgc gaacagagaa cgtcaggata tcgtcaccct agtgaaacgg cagctgacgc 1020 

gtgaaatctc ggaaggacgc ttcactgcta atggtgaggc ggtaaaacgc gaagtggagc 1080 

gtcgtgtgaa ggagcgcatg attctgtcac gtaaccgcaa ttacagccgg ctggccacag 1140 

cttctccctg aaagtgatct cctcagaata atccggcctg cgccggaggc atccgcacgc 1200 

ctgaagcccg ccggtgcaca aaaaaacagc gtcgcatgca aaaaacaatc tcatcatcca 1260 

ccttctggag catccgattc cccctgtttt taatacaaaa tacgcctcag cgacggggaa 1320 

ttttgcttat ccacatttaa ctgcaaggga cttccccata aggttacaac cgttcatgtc 1380 

ataaagcgcc agccgccagt cttacagggt gcaatgtatc ttttaaacac ctgtttatat 1440 

ctcctttaaa ctacttaatt acattcattt aaaaagaaaa cctattcact gcctgtcctg 1500 

tggacagaca gatatgca 1518 



<210> 14 

<21i> 38 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Estrogen Receptor Target Recognition Sequence 
<400> 14 

tcaggtcaga gtgacctgag ctaaaataac acattcag 38 



<210> 15 

<211> 828 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> repA sequence 



<220> 
<221> CDS 
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<222> (1)..(828) 
<223> 

<400> 15 

atg gta aag aac ccg aat ccg gtg ttc act ccc cgt gaa ggt gcc gga 48 

Met Val Lys Asn Pro Asn Pro Val Phe Thr Pro Arg Glu Gly Ala Gly 
1 5 10 15 

acg ccg aag ttc cgc gaa aaa ccg atg gaa aag gcg gtg ggc etc acc 96 
Thr Pro Lys Phe Arg Glu Lys Pro Met Glu Lys Ala Val Gly Leu Thr 
20 25 30 

tec cgt ttt gat ttc gcc att cat gtg gcg cat gcc cgt tec cgt ggt 144 
Ser Arg Phe Asp Phe Ala He His Val Ala His Ala Arg Ser Arg Gly 
35 40 45 

ctg cgt egg cgc atg cca ccg gtg ctg cgt cga egg get att gat gcg 192 
Leu Arg Arg Arg Met Pro Pro Val Leu Arg Arg Arg Ala He Asp Ala 
50 55 60 

ctg ctg cag ggg ctg tgt ttc cac tat gac ccg ctg gcc aac cgc gtc 240 
Leu Leu Gin Gly Leu Cys Phe His Tyr Asp Pro Leu Ala Asn Arg Val 
65 70 75 80 

cag tgt tec ate acc aca ctg gcc att gag tgc gga ctg gcg aca gag 288 
Gin Cys Ser He Thr Thr Leu Ala He Glu Cys Gly Leu Ala Thr Glu 
85 90 95 

tec ggt gca gga aaa etc tec ate acc cgt gcc acc egg gcc ctg acg 336 
Ser Gly Ala Gly Lys Leu Ser He Thr Arg Ala Thr Arg Ala Leu Thr 
100 105 110 

ttc ctg tea gag ctg gga ctg att acc tac cag acg gaa tat gac ccg 384 
Phe Leu Ser Glu Leu Gly Leu He Thr Tyr Gin Thr Glu Tyr Asp Pro 
115 120 125 

ctt ate ggg tgc tac att ccg acc gac ate acg ttc aca ctg get ctg 432 
Leu He Gly Cys Tyr He Pro Thr Asp He Thr Phe Thr Leu Ala Leu 
130 135 140 

ttt get gcc ctt gat gtg tct gag gat gca gtg gca get gcg cgc cgc 480 
Phe Ala Ala Leu Asp Val Ser Glu Asp Ala Val Ala Ala Ala Arg Arg 
145 150 155 160 

agt cgt gtt gaa tgg gaa aac aaa cag cgc aaa aag cag ggg ctg gat 528 
Ser Arg Val Glu Trp Glu Asn Lys Gin Arg Lys Lys Gin Gly Leu Asp 
165 170 175 

acc ctg ggt atg gat gag ctg ata gcg aaa gcc tgg cgt ttt gtg cgt 576 
Thr Leu Gly Met Asp Glu Leu He Ala Lys Ala Trp Arg Phe Val Arg 
180 185 190 
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gag cgt ttc cgc agt tac cag aca gag ctt cag tec cgt gga ata aaa 624 
Glu Arg Phe Arg Ser Tyr Gin Thr Glu Leu Gin Ser Arg Gly He Lys 
195 200 205 

cgt gec cgt gcg cgt cgt gat gcg aac aga gaa cgt cag gat ate gtc 672 
Arg Ala Arg Ala Arg Arg Asp Ala Asn Arg Glu Arg Gin Asp He Val 
210 215 220 

ace eta gtg aaa egg cag ctg acg cgt gaa ate teg gaa gga cgc ttc 720 
Thr Leu Val Lys Arg Gin Leu Thr Arg Glu lie Ser Glu Gly Arg Phe 
225 230 235 240 

act get aat ggt gag gcg gta aaa cgc gaa gtg gag cgt cgt gtg aag 768 
Thr Ala Asn Gly Glu Ala Val Lys Arg Glu Val Glu Arg Arg Val Lys 
245 250 255 

gag cgc atg att ctg tea cgt aac cgc aat tac age egg ctg gee aca 816 
Glu Arg Met He Leu Ser Arg Asn Arg Asn Tyr Ser Arg Leu Ala Thr 
260 265 270 

get tct ccc tga 828 
Ala Ser Pro 
275 



<210> 16 

<211> 275 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> repA sequence 

<400> 16 

Met Val Lys Asn Pro Asn Pro Val Phe Thr Pro Arg Glu Gly Ala Gly 
1 5 10 15 

Thr Pro Lys Phe Arg Glu Lys Pro Met Glu Lys Ala Val Gly Leu Thr 
20 25 30 

Ser Arg Phe Asp Phe Ala He His Val Ala His Ala Arg Ser Arg Gly 
35 40 45 

Leu Arg Arg Arg Met Pro Pro Val Leu Arg Arg Arg Ala He Asp Ala 
50 55 60 

Leu Leu Gin Gly Leu Cys Phe His Tyr Asp Pro Leu Ala Asn Arg Val 
65 70 75 80 
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Gin Cys Ser He Thr Thr Leu Ala He Glu Gys Gly Leu Ala Thr Glu 
85 90 ' 95 

Ser Gly Ala Gly Lys Leu Ser lie Thr Arg Ala Thr Arg Ala Leu Thr 
100 105 110 

Phe Leu Ser Glu Leu Gly Leu He Thr Tyr Gin Thr Glu Tyr Asp Pro' 
115 120 125 

Leu He Gly Cys Tyr lie Pro Thr Asp He Thr Phe Thr Leu Ala Leu 
130 135 140 

Phe Ala Ala Leu Asp Val Ser Glu Asp Ala Val Ala Ala Ala Arg Arg 
145 150 155 160 

Ser Arg Val Glu Trp Glu Asn Lys Gin Arg Lys Lys Gin Gly Leu Asp 
165 170 175 

Thr Leu Gly Met Asp Glu Leu He Ala Lys Ala Trp Arg Phe Val Arg 
180 185 190 

Glu Arg Phe Arg Ser Tyr Gin Thr Glu Leu Gin Ser Arg Gly He Lys 
195 200 205 

Arg Ala Arg Ala Arg Arg Asp Ala Asn Arg Glu Arg Gin Asp He Val 
210 215 220 

Thr Leu Val Lys Arg Gin Leu Thr Arg Glu He Ser Glu Gly Arg Phe 
225 230 235 240 

Thr Ala Asn Gly Glu Ala Val Lys Arg Glu Val Glu Arg Arg Val Lys 
245 250 255 

Glu Arg Met lie Leu Ser Arg Asn Arg Asn Tyr Ser Arg Leu Ala Thr 
260 265 270 

Ala Ser Pro 





275 


<210> 


17 


<211> 


172 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


CIS DNA element 


<400> 


17 



aagtgatctc ctcagaataa tccggcctgc gccggaggca tccgcacgcc tgaagcccgc 60 
cggtgcacaa aaaaacagcg tcgcatgcaa aaaacaatct catcatccac cttctggagc 120 
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atccgattcc ccctgttttt aatacaaaat acgcctcagc gacggggaat tt 172 



<210> 18 
<211> 195 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> ori sequence 
<400> 18 

tgcttatcca catttaactg caagggactt ccccataagg ttacaaccgt tcatgtcata 60 
aagcgccagc cgccagtctt acagggtgca atgtatcttt taaacacctg tttatatctc 120 
ctttaaacta cttaattaca ttcatttaaa aagaaaacct attcactgcc tgtcctgtgg 180 
acagacagat atgca 195 



<210> 19 . 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 19 

aattccccgt cgctgaggcg 20 



<210> 20 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 20 

cgtaagccgg tactgattga 20 



<210> 21 

<211> 110 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer 
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<400> 21 

cacaggaaac aggatctacc atggccggaa aacctatccc aaaccctctc ctaggactgg 60 
attcaacggg gggaggagga tcagcggccg caactgatct tcaccaaacg 110 



<210> 22 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 
<220> 

<221> mi sc_f eature 

<222> (29).. (30) 

<223> n = a, g. c or t 

<220> 

<221> mi sc_f eature 

<222> (32).. (33). 

<223> n = a, g, c or t 

<220> 

<221> mi sc f eature 

<222> (35) . . (36) 

<223> n = a, g. c or t 

<220> 

<221> mi sc_f eature 

<222> (38) . . (39) 

<223> n = a. g, c or t 

<220> 

<221> mi sc_feature 

<222> (41) . . (42) 

<223> n - a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (44).. (45) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_feature 

<222> (47) . . (48) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (50).. (51) 
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<223> n = a, g, c or t 
<220> 

<221> mi sc_f eature 

<222> (53).. (54) 

<223> n = a, g. c or t 

<220> 

<221> mi sc_f eature 

<222> (56).. (57) 

<223> n = a. g, c or t 

<220> 

<221> mi sc_feature 

<222> (59).. (60) 

<223> n = a. g, c or t 

<220> 

<221> mi sc_feature 
<222> (62).. (63) 

<223> n = a. g, c or t 

<400> 22 

cacacaggaa acaggatcta ccatggccnn bnnbnnbnnb nnbnnbnnbn nbnnbnnbnn 60 
bnnbggggga ggaggatcag cggccgcaac tgatcttcac caaacg 106 



<210> 23 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 23 

cagttgatcg gcgcgagatt 



<210> 24 
<211> 2390 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> TAC-V5-REPA-CIS-ORI-408 construct 
<400> 24 

cagttgatcg gcgcgagatt taatcgccgc gacaatttgc gacggcgcgt gcagggccag 60 
actggaggtg gcaacgccaa tcagcaacga ctgtttgccc gccagttgtt gtgccacgcg 120 
gttgggaatg taattcagct ccgccatcgc cgcttccact ttttcccgcg ttttcgcaga 180 
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aacgtggctg gcctggttca ccacgcggga aacggtctga taagagacac cggcatactc 240 

tgcgacatcg tataacgtta ctggtttcac attcaccacc ctgaattgac tctcttccgg 300 

gcgctatcat gccataccgc gaaaggtttt gcaccattcg gctagcgatg accctgctga 360 

ttggttcgct gaccatttcc ggggtgcgga acggcgttac cagaaactca gaaggttcgt 420 

ccaaccaaac cgactctgac ggcagtttac gagagagatg atagggtctg cttcagtaag 480 

ccagatgcta cacaattagg cttgtacata ttgtcgttag aacgcggcta caattaatac 540 

ataaccttat gtatcataca catacgattt aggtgacact atagaataca agcttactcc 600 

ccatccccct gttgacaatt aatcatggct cgtataatgt gtggaattgt gagcggataa 660 

caatttcaca caggaaacag gatctaccat ggccggaaaa cctatcccaa accctctcct 720 

aggactggat tcaacggggg gaggaggatc agcggccgca actgatcttc accaaacgta 780 

ttaccgccag gtaaagaacc cgaatccggt gtttacaccc cgtgaaggtg caggaacgct 840 

gaagttctgc gaaaaactga tggaaaaggc ggtgggcttc acttcccgtt ttgatttcgc 900 

cattcatgtg gcgcatgccc gttcgcgtgg tctgcgtcga cgcatgccac cagtgctgcg 960 

tcgacgggct attgatgcgc tcctgcaggg gctgtgtttc cactatgacc cgctggccaa 1020 

ccgcgtccag tgctccatca ccacgctggc cattgagtgc ggactggcga cggagtctgc 1080 

tgccggaaaa ctctccatca cccgtgccac ccgggccctg acgttcctgt cagagctggg 1140 

actgattacc taccagacgg aatatgaccc gcttatcggg tgctacattc cgaccgatat 1200 

cacgttcaca tctgcactgt ttgctgccct cgatgtatca gaggaggcag tggccgccgc 1260 

gcgccgcagc cgtgtggtat gggaaaacaa acaacgcaaa aagcaggggc tggataccct 1320 

gggcatggat gaactgatag cgaaagcctg gcgttttgtt cgtgagcgtt ttcgcagtta 1380 

tcagacagag cttaagtccc ggggaataaa gcgtgcccgt gcgcgtcgtg atgcggacag 1440 

ggaacgtcag gatattgtca ccctggtgaa acggcagctg acgcgcgaaa tcgcggaagg 1500 

gcgcttcact gccaatcgtg aggcggtaaa acgcgaagtt gagcgtcgtg tgaaggagcg 1560 

catgattctg tcacgtaacc gtaattacag ccggctggcc acagcttccc cctgaaagtg 1620 

acctcctctg aataatccgg cctgcgccgg aggcttccgc acgtctgaag cccgacagcg 1680 

cacaaaaaat cagcaccaca tacaaaaaac aacctcatca tccagcttct ggtgcatccg 1740 

gccccccctg ttttcgatac aaaacacgcc tcacagacgg ggaattttgc ttatccacat 1800 

taaactgcaa gggacttccc cataaggtta caaccgttca tgtcataaag cgccatccgc 1860 

cagcgttaca gggtgcaatg tatcttttaa acacctgttt atatctcctt taaactactt 1920 

aattacattc atttaaaaag aaaacctatt cactgcctgt cctgtggaca gacagatatg 1980 

cacctcccac cgcaagcggc gggcccctac cggagccgct ttagttacaa cactcagaca 2040 

caaccaccag aaaaaccccg gtccagcgca gaactgaaac cacaaagccc ctccctcata 2100 

actgaaaagc ggccccgccc cggcccgaag ggccggaaca gagtcgcttt taattatgaa 2160 

tgttgtaact acttcatcat cgctgtcagt cttctcgctg gaagttctca gtacacgctc 2220 

gtaagcggcc ctgacggccc gctaacgcgg agatacgccc cgacttcggg taaaccctcg 2280 

tcgggaccac tccgaccgcg cacagaagct ctctcatggc tgaaagcggg tatggtctgg 2340 

cagggctggg gatgggtaag gtgaaatcta tcaatcagta ccggcttacg 2390 



<210> 25 

<211> 2384 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> TAC-NNB-REPA-CIS-ORI-408 construct 
<220> 

<221> mi sc_f eature 

<222> (695).. (696) 

<223> n = a, g. c or t 
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<220> 

<221> mi sc_f eature 

<222> (698) . . (699) 

<223> n = a. g, c or t 

<220> 

<221> mi sc_feature 

<222> (701).. (702) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (704) . . (705) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_feature 

<222> (707).. (708) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_feature 

<222> (710).. (711) 

<223> n = a, g. c or t 

<220> 

<221> mi sc_feature 

<222> (713).. (714) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_feature 

<222> (716).. (717) 

<223> n = a, g. c or t 

<220> 

<221> mi sc_f eature 

<222> (719).. (720) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (722) . . (723) 

<223> n = a. g. c or t 

<220> 

<221> mi sc_f eature 

<222> (725) . . (726) 
<223> n - a, g, c or t 
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<220> 

<221> mi sc_f eature 
<222> (728).. (729) 
<223> n = a, g, c or t 

<400> 25 

cagttgatcg gcgcgagatt taatcgccgc gacaatttgc gacggcgcgt gcagggccag 60 

actggaggtg gcaacgccaa tcagcaacga ctgtttgccc gccagttgtt gtgccacgcg 120 

gttgggaatg taattcagct ccgccatcgc cgcttccact ttttcccgcg ttttcgcaga 180 

aacgtggctg gcctggttca ccacgcggga aacggtctga taaga'gacac cggcatactc 240 

tgcgacatcg tataacgtta ctggtttcac attcaccacc ctgaattgac tctcttccgg 300 

gcgctatcat gccataccgc gaaaggtttt gcaccattcg gctagcgatg accctgctga 360 

ttggttcgct gaccatttcc ggggtgcgga acggcgttac cagaaactca gaaggttcgt 420 

ccaaccaaac cgactctgac ggcagtttac gagagagatg atagggtctg cttcagtaag 480 

ccagatgcta cacaattagg cttgtacata ttgtcgttag aacgcggcta caattaatac 540 

ataaccttat gtatcataca catacgattt aggtgacact atagaataca agcttactcc 600 

ccatccccct gttgacaatt aatcatggct cgtataatgt gtggaattgt gagcggataa 660 

caatttcaca caggaaacag gatctaccat ggccnnbnnb nnbnnbnnbn nbnnbnnbnn 720 

bnnbnnbnnb gggggaggag gatcagcggc cgcaactgat cttcaccaaa cgtattaccg 780 

ccaggtaaag aacccgaatc cggtgtttac accccgtgaa ggtgcaggaa cgctgaagtt 840 

ctgcgaaaaa ctgatggaaa aggcggtggg cttcacttcc cgttttgatt tcgccattca 900 

tgtggcgcat gcccgttcgc gtggtctgcg tcgacgcatg ccaccagtgc tgcgtcgacg 960 

ggctattgat gcgctcctgc aggggctgtg tttccactat gacccgctgg ccaaccgcgt 1020 

ccagtgctcc atcaccacgc tggccattga gtgcggactg gcgacggagt ctgctgccgg 1080 

aaaactctcc atcacccgtg ccacccgggc cctgacgttc ctgtcagagc tgggactgat 1140 

tacctaccag acggaatatg acccgcttat cgggtgctac attccgaccg atatcacgtt 1200 

cacatctgca ctgtttgctg ccctcgatgt atcagaggag gcagtggccg ccgcgcgccg 1260 

cagccgtgtg gtatgggaaa acaaacaacg caaaaagcag gggctggata ccctgggcat 1320 

ggatgaactg atagcgaaag cctggcgttt tgttcgtgag cgttttcgca gttatcagac 1380 

agagcttaag tcccggggaa taaagcgtgc ccgtgcgcgt cgtgatgcgg acagggaacg 1440 

tcaggatatt gtcaccctgg tgaaacggca gctgacgcgc gaaatcgcgg aagggcgctt 1500 

cactgccaat cgtgaggcgg taaaacgcga agttgagcgt cgtgtgaagg agcgcatgat 1560 

tctgtcacgt aaccgtaatt acagccggct ggccacagct tccccctgaa agtgacctcc 1620 

tctgaataat ccggcctgcg ccggaggctt ccgcacgtct gaagcccgac agcgcacaaa 1680 

aaatcagcac cacatacaaa aaacaacctc atcatccagc ttctggtgca tccggccccc 1740 

cctgttttcg atacaaaaca cgcctcacag acggggaatt ttgcttatcc acattaaact 1800 

gcaagggact tccccataag gttacaaccg ttcatgtcat aaagcgccat ccgccagcgt 1860 

tacagggtgc aatgtatctt ttaaacacct gtttatatct cctttaaact acttaattac 1920 

attcatttaa aaagaaaacc tattcactgc ctgtcctgtg gacagacaga tatgcacctc 1980 

ccaccgcaag cggcgggccc ctaccggagc cgctttagtt acaacactca gacacaacca 2040 

ccagaaaaac cccggtccag cgcagaactg aaaccacaaa gcccctccct cataactgaa 2100 

aagcggcccc gccccggccc gaagggccgg aacagagtcg cttttaatta tgaatgttgt 2160 

aactacttca tcatcgctgt cagtcttctc gctggaagtt ctcagtacac gctcgtaagc 2220 

ggccctgacg gcccgctaac gcggagatac gccccgactt cgggtaaacc ctcgtcggga 2280 

ccactccgac cgcgcacaga agctctctca tggctgaaag cgggtatggt ctggcagggc 2340 

tggggatggg taaggtgaaa tctatcaatc agtaccggct tacg 2384 



<210> 26 
<211> 26 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Primer 
<400> 26 

ccccatcccc ctgttgacaa ttaatc 26 



<210> 27 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 27 

ggtagatcct gtttcctgtg tg 22 



<210> 28 

<211> 110 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 
<220> 

<221> mi sc_f eature 

<222> (37) . . (38) . 

<223> n = a. g. c or t 



<220> 

<221> mi sc_f eature 

<222> (40) . . (41) 

<223> n = a, g. c or t 

<220> 

<221> mi sc_f eature 

<222> (43).. (44) 

<223> n = a. g, c or t 

<220> 

<221> mi sc_f eature 

<222> (46) . . (47) 

<223> n = a, g. c or t 
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<220> 

<221> mi sc_feature 

<222> (49) . . (50) 

<223> n =■ a. g, c or t 

<220> 

<221> mi sc_feature 

<222> (52).. (53) 

<223> n - a, g, c or t 

<220> 

<221> mi sc_feature 

<222> (55) . . (56) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_feature 

<222> (58) . . (59) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (61) . . (62) 

<223> n - a, g, c or t 

<220> 

<22 1> mi sc_f eat u re 

<222> (64), . (65) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (67) . . (68) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 

<222> (70) . . (71) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 
<222> (73).. (74) 

<223> n = a, g, c or t 

<220> 

<221> mi sc_f eature 
<222> (76).. (77) 

<223> n = a. g, c or t 
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<220> 

<221> misc_feature 

<222> (79).. (80) 

<223> n = a. g. c or t 

<220> 

<221> mi scfeature 

<222> (82) . . (83) 

<223> n = a, g, c or t 

<220> 

<221> mi sc feature 

<222> (85).. (86) 

<223> n = a, g. c or t 

<220> 

<221> mi sc f eature 

<222> (88) . . (89) 

<223> n = a, g, c or t 

<400> 28 

acataccgtc atgcggccgc tgatcctcct cccccvnnvn nvnnvnnvnn vnnvnnvnnv 60 
nnvnnvnnvn nvnnvnnvnn vnnvnnvnng gccatggtag atcctgtttc 110 



<210> 29 

<211> 49 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 29 

ctggagatgg catcaagggc cccaactgat cttcaccaaa cgtattacc 49 



<210> 30 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer 



<400> 30 

ggcgctatca tgccataccg 



20 
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<210> 31 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 
<400> 31 

accattcggc tagcgatgac 20 



<210> 32 

<211> 33 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 32 

ggtga agate agttgcggcc gctgatcctc etc 33 



<210> 33 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 33 

gattggttcg ctgaccattt cc 22 



<210> 34 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 34 

cggcgttacc agaaactcag a 21 



<210> 35 
<211> 20 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Primer 
<400> 35 

aaccgactct gacggcagtt 20 



<210> 36 

<211> 18 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 

<400> 36 

Cys Gly Cys Pro Thr Met Ala Ala Arg Val Arg Pro Val Leu Asn Ser 
i 5 10 15 

Lys Hjs 



<210> 37 
<211> 11 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 
<400> 37 

Met Thr Thr Val Pro Val Leu Met lie Ser Val 
15 10 



<210> 38 

<211> 18 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 

<400> 38 

Thr Leu Ser Thr Arg His His Asn Val He Asp Arg Phe Asn Leu Arg 
1 5 10 15 



Asn Phe 
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<210> 39 

<211> 18 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 

<400> 39 

Ser He Arg Thr Leu Thr Gly Ser Thr Pro Ala Gin Phe Asp Ala Thr 
1 5 10 15 

Ala Asp 



<210> 40 

<211> 12 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 

<400> 40 

Ala Asn Leu Trp Arg He Val Leu His Gly Trp Trp 
1 5 10 



<210> 41 

<211> 12 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 

<400> 41 

Val Ser Phe Met Leu Leu Gly Pro His Arg His Arg 
1 5 10 



<210> 42 

<211> 11 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Peptide 
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<400> 42 

Leu Val Leu His Trp Leu Ser Leu Gly Ser Arg 
1 5 10 



<210> 43 

<211> 12 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 

<400> 43 

Ser Asn Gin Val Val Leu He Leu His Leu Arg Pro 
1 5 10 



<210> 44 

<211> 12 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Peptide 

<400> 44 

Ala Glu Ser Trp Leu His Gin Ser Trp He His Leu 
1 5 10 



<210> 45 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 45 

caggaaacag gatctaccat gctgcaggag tcaggacctg ag 



<210> 46 

<211> 53 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer 
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<400> 46 

gtttggtgaa gatcagttga tcctcctccc ccccgctcga ggaagatgga tac 53 

<210> 47 

<211> 53 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 47 

gtatccatct tcctcgagcg ggggggagga ggatcaactg atcttcacca aac 53 

<210> 48 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<400> 48 

ctcaggtcct gactcctgca gcatggtaga tcctgtttcc tg 



42 
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